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BACKGROUND 



Cardiovascular disease is a major health risk througjiout the industrialized world. Coronary 
Artery Disease (CAD) is characterized by atherosclerosis or hardening of the arteries. Atherosclerosis 
is the most preval^t of cardiovascular diseases, is the principal cause of heart attacli; stroke, and 
gangrene of the extremities, and thereby the principle cause of death in the United States. 
15 Atherosclerosis is a complex disease involving many cell types and molecular factors (described in, 
for example, Ross, 1993, Nature 362: 801-809). In normal circumstances a protective response to 
insults to the endothelium and smooth muscle cells (SMCs) of the wall of the artery consists of the 

The advanced lesions of atherosclerosis may occlude the artery concerned, and result from an 
20 excessive inflammatory-fibroproliferative response to numerous difierent forms of insult injury or 
dysfunction of the vascular endothelium is a common feature of many conditions &at predispose an 
individual to accelerated development of atiierosclerotic cardiovascular disease. 

Atherosclerotic plaques occlude the blood vessel concerned and restrict the flow of blood, 
resulting in ischemia. Ischemia is a condition characterized by a lack of oxyg^ supply in tissues of 
25 organs due to inadequate perfusion. Such inadequate perfusion can have a number of natural causes, 
including atherosclerotic or restenotic lesions, anemia, or stroke. The most common cause of 
ischemia in the heart is atherosclerotic disease of epicardial coronary artCTies. By reducing tixe lumen 
of these vessels, atherosclerosis causes an absolute decrease in myocardial perfusion in the basal state 
or limits appropriate increases in perfusion when the demand for flow is augmented. Coronary blood 
30 flow can also be limited by arterial thrombi, spasm, and, rarely, coronary emboli, as well as by ostial 
narrowing due to luetic aortitis. Congenital abnormalities, such as anomalous cnigin of the left 
anterior descending coronary artery from the pulmonary artery, may cause myocardial ischemia and 
infarction in infancy, but this cause is very rare in adults. 
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Myocardial ischemia can also occur if myocaidial oxygen demands arc abnomially increased, 
as in severe ventricular hypertrophy due to hypertension or aortic stenosis. The latter can be present 
with angina that is indistinguishable from tiiat caused by coronary atherosclerosis. A reduction in the 
oxygen-carrying capacity of ^ blood, as in extremely severe an^nia or in the presmce of caiboxy- 
5 hemoglobin, is a rare cause of nQrocardial ischemia. Not infrequently, two or more causes of ischemia 
will coexist, such as an increase in oxygen demsnd due to left voitricuiar liypertrophy and a reduction 
in oxygen siqjply secondary to corOTaxy athCTOSclerosis. 

Extensive clinical studies have identified ^tors that increase the risk of cardiovascular 
disorders. Some of these risk factors, such as age, gender, and family history cannot be changed, 
to Other risk factors include the following: smoking, hig^ blood pressure, hig^ fat and high cholesterol 
diet, diabetes, lack of exo^se, obesity, and stress. 

Fortunately, many contributing factors are controllable tfarou^ lifestyle changes. The risk of 
cardiovascular disorders for sn:u>k^s is more than twice that of non-smokers. When a perscm stops 
smoking, regardless of how much he or she may have smoked in the past, their risk of developing a 
15 disorder rapidly declines. Serum cholesterol level is directly related to prevalence of cardiovascular 
disorder and hypertension or high blood pressure is an important risk factor. Physical activity has 
beoi postulated to reduce the risk of developing a cardiovascular disorder througji various 

myocardial contraction and its electrical impulse stabiUty. Reduced oxygoi demand and myocardial 
20 work are reflected in lowered heart rate and blood pressure at rest Physical activity also increases the 
diameter and dilatory capacity of coronary arteries, increases collateral artery formati(m, and reduces 
rates of progression of coronary artery atherosclerosis. Obesity and the serum fatty acids are reduced 
by activity. 

There may be no noticeable symptoms of a cardiovascular disorder at rest, but symptoms such 
25 as chest pressure may occur with increased activity or stress. Other first signs that can appear are 

heartburn, nausea, vomiting, numbness, shortness of breath, heavy cold sweating, unexplained &tigue, 
and feelings of anxiety. The more severe symptoms of cardiovascular disorders are chest pain (angina 
pectoris), rhytimi disturbances (arrhythmias), stroke, or h^rt attack (myocardial infarction). Strokes 
and h^irt attacks r^ult &om a blocked arteiy in the brain and heart tissue, respectively. Because 
30 synq>toms vary, the tests and treatments chosen can be very differ^t from one patient to another. 

Diagnostic tests useful in determining the extmt and severity of cardiovascular disorder 
include: electrocardiogram (EKG), stress test, nuclear scanning, coronary angiography, resting £KG, 
EKG Multiphase friformation Diagnosis Index^, Holter monitor, late potentials, EKG mapping, 
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There are many different tiberapeutic approaches, depending on the seriousness of tixe disease. For 

many people, cardiovascular disorders are managed with lifestyle changes and medications. More 
5 severe diagnoses may indicate a need for surg^. 

Surgical approaches to the treatment of ischemic atherosclerosis include bypass grafting, 

coronary angioplasty, laser angioplasty, atherectomy, endarterectomy, and percutaneous translumenal 

angioplasty (PCTA). The failure rate after litese approaches due to restenosis, in which the occlusions 

recur and often become even worse, is extraordinarily hig^ (30-50%). It appears that much of the 
10 restenosis due to further inflammation, smooth muscle accumulation, and thrombosis. Additional 

therapeutic approaches to cardiovascular disease have included treatments that encouraged 

angiogenesis in such conditions as ischemic heart and limb disease. 

The non-specific nature of most CAD and cardiovascular disorder syn^toms makes definitive 

diagnosis difficult. More quantitative diagnostic methods suSer from variability, both between 
15 individuals and between readings on a sin^e individual. Thus, diagnostic measures must be 

standardized and applied to individuals wi& well-documented and extensive medical histories. 

Further, current diagnostic methods often do not reveal the underlying cause for a given observation 



expression profilirsg. For exanq>le, genes tiiat are known to be involved in cardiovascular disorders 



hybridization-based techniques, or PGR. In anoth^ exan:q}le, egression from a known gene may be 
tracked by standard techniques including RITCR, various hybridization-based techniques, and 

25 sequencing. These strategies often do not enable a practitioner to detect differences in n[]RNA 

processing and splicing, translation rate, mRNA stability, and posttranslational modifications such as 
proteolytic processing, phosphorylation, glycosylation, and amidation. 

To address the current weaknesses in the diagnostic state of the art for cardiovascular 
disorders, the invmtion provides specific plasma polypeptides that are differentially increased in 

30 plasma &om individuals with Coronary Artery Disease compared to control plasma. By providing the 
actual polypeptide ^ecies, di^erences in oaRNA processing and splicing, translation rate, mRNA 
stability, and posttranslational modifications such as proteolytic processing, phosphoiylation, 
glycosylation, and anddation are revved. The polypeptides of the invention are thus described as 
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may be screened for mutations using c 



genotyping techniques such as sequencing, 
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"Cardiovascular disordo- Plasm Polypeptides" CPPs. These polypeptide sequences are described 
asSEQIDNOs:1.2,6-7, 11-12, 15-17, and 24-25, and tboseconq)rising at least one of the anm 
sequences selected from SEQ ID NOs:3-5, 8-10, 13-14, 18-23 and 26-28 (see Figures 1-5). These 
polypeptides are more precisely refored to as "Cardiovascular disorder Plasma Polypeptides 2, 9, 17, 

20 and 21" (CPP 2 corresponding to SEQ ID NOs:l-5, CPP 9 corresponding to SEQ ID NOs:6.10, 
CPP 17 corresponding to SEQ ID NOs: 15-23. CPP 20 corresponding to SEQ ID NOs:24-28 and CPP 

21 corresponding to SEQ ID NOs:ll-14), and include fragments, and post-translationally modified 
species of CPPs that are present at a higher level in plasma obtained from individuals wi& Coronary 
Artery Disease (CAD). PrefOTed fragments of the invention are those described as SEQ ID NOs:3-5, 
8-10, 13-14, 18-23 and 26-28. Thus, the CPPs of the invention represent an in^ortant diagnostic tool 
for determinmg the risk of coronary artey disease (CAD), coronaxy heart disease (CHD), peripheral 
vascular disease, c&c^tbI ischemia (stroke), congestive heart failure, atherosclerosis, hypertension, 
and o&er cardiovascular diseases. CPPs are secreted factors and as such, are readily detectable and 
useful for drug development, diagnosis, and prevention of cardiovascular diseases. 

SUMMARY OF THE INVENTION 

The present invention is directed to compositions related to secreted polypeptide species that 

polyp^tide species are designated herein ^Carfflovascular disordo: Plasma Polypeptides," or CPPs. 
Such Cardiovascular disord^ Plasma Polypeptides comprise an amino acid sequence selected from 
one of the gcoups consisting of SEQ ID NOs:l-5, 6-10, 1 1-14, 15-23 and 24-28, and are designated 
CPP 2, CPP 9, CPP 17, CPP 20 and CPP 21, respectively. Compositions include CPP precursors, 
antibodies specific for CPPs, including monoclonal antibodies and other binding compositions 
derived therefrom. Furths* included are metiiods of making and using these con^ositions. Precursors 
of the invention include unmodified precursors, proteolytic precursors of SEQ ID NOs: 1-28, and 
mtermediates resulting fixmi alternative proteolytic sites in the amino acid sequraces of SEQ ID 
NOs: 1-28. 

A preferred embodiment of the invention includes CPPs having a posttranslational 
modification, such as a phosphorylation, giycosyiation, aoetylation, amidaticm, or a C-, N- or O- 
linked carbohydrate group. Additionally preferred are CPPs wife intra- or inter-molecular 
interactions, e.g., disulfide and hydrogen bonds that result in higher order structures. Also preferred 
are CPPs that result from differential mRNA processing or splicing. Preferably, the CPPs represent 
post-*ranslationally modified species, structural variants, or splice variants that are present in plasma 
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from individuals with a cardiovascular disorder. 

In another aspect, the invention includes CPPs con^Jiising a sequence which is at least 75 
pax:ent identical to a sequence selected from one of the groiq)s consisting of SEQ ID NOs:l-5, and 
1 1-23. Preferably, the invention includes polypeptides comprising at least 85 percent, and more 
preferably at least 90 perc«t, and still more pref^bly at least 95 pment, identity with any one of 
the sequences selected from SEQ ID NOs: 1-5, and 1 1-23. Most ixeferably, the invention include 
polypeptides comprising a sequence at least 99 p^ent identical to a sequence selected from one of ' 
the groups consisting of SEQ ID NOs: 1-5, and 1 1-23. 

In anotiier aspect, the invention includes CPPs comprising a sequence which is at least 85 
p^x^ent identical to a sequence selected from the g^oup consisting of SEQ ID NOs:6-10. Preferably, 
the invention includes polypeptides conqnising at least 90 percent, and more preferably at least 95 
Xyercent, and still more preferably at least 97 pencrait, identity with any one of the sequences selected 
from SEQ ID NOs:6-10. Most prefmbly, the invention includes polypeptides conqirising a sequence 
at least 99 percent idaitical to a sequ«ice selected from tiie group consisting of SEQ ID NOs:6-10. 

In ano^r aspect, the invention includes CPPs con:^>rising a sequence which is at least 95 
percCTt identical to a sequence selected from the group consisting of SEQ ID NOs:24-28. Preferably, 
the invention include polypeptides comprising at least 97 percmt, and more preferably at least 98 

from SEQ ID NOs:24-28. Most preferably, the invention includes polypeptides conpising a 
sequence at least 99 percent identical to a sequaice selected from the group consisting of SEQ ID 
NOs:24-28. 

In another aspect, ^ invention includes natural variants of CPPs having a frequency in a 
selected population of at least two percent More preferably, such natural variant has a frequency in a 
selected population of at least five percrat, and still more preferably, at least ten p^cent Most 
preferably, such natural variant has a frequraicy in a selected population of at least twenty percent 
The selected population may be any recognized population of study in the field of population 
genetics. Preferably, the selected population is Caucasian, Negroid, or Asian. More prefi^ably, the 
selected population is French, German, English, Spanish, Swiss, Japanese, Chinese, Irish, Korean, 
Smgqjorean, Icelandic, Nor& American, laraeH, Arab, Turkish, Greek, Italian, Polish, Pacific 
Islander, Finnish, Norwegian, Swedish, Estonian, Austrian, or IndiaiL More preferably, the selected 
population is Icelandic, Saami, Finnish, French of Caucasian ancestry, Swiss, Singaporean of Chinese 
ancestry, Korean, Japanese, Quebecian, North American Pima Indians, Pennsylvanian Amish and 
Amish Mennonite, Newfoundlander, or Polynesian. 
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A preferred aspect of the invCTtioii |»x>vides a conpositioxi coznprising an isolated CPP, i.e., a 
CPP jfree Cram proteins or protein isoforms having a significantly different isoelectric point or a 
significantly different apparent molecular weig^ from the CPP. The isoelectric point and molecular 
weight of a CPP may be indicated by afOniiy and size-based separation chromatography, 2- 
dimensional gel analysis^ and mass spectrometiy. 

hi a preferred aspect, the invention provides particular polypq>tide species that comprise an 
amino acid sequsice selected from the one of the groiq)s consisting of SEQ ID NOs:3-5, 8-10, 13-14, 
18-23 and 26-28. Preferably, Ihe particidar polypq}tide species further comprises contiguous amino 
acid sequence from SEQ ID NOs: 1-2, 6-7, 11-12, 15-17, and 24-25, respectively. Preferred species 
are pols^peptides &at i) comprise an amino acid sequence selected from one of the groups consisting 
of SEQ ID NOs: 3-5, 8-10, 13-14, 18-23 and 26-28; ii) appear at a hi^r level m plasma fi-om 
individuals with, a cardiovascular disorder and iiO optionally result from proteolytic processing of the 
polyp^tides of SEQ ID NO: 1-2, 6-7, 1 1-12, 15-17, and 24-25, respectively. 

In a still finth^ aspect, the invention provides a combination of two or more of the 
polypq>tides selected from His one of the groups consistmg consisting of SEQ ID N0s:l-5, 6-10, 1 1- 
14, 15-23 and 24-28. 

In an additional aspect, the invention includes modified CPPs. Such modifications include 

labels, such as an enzymatic, fluorescent, isotopic or afBnity label to allow for detection and isolation 
of the protein. Chemical modifications may be carried out by known techniques, including but not 
limited, to specific chemical cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, V8 
protease, NaBH4, acetylation, formylation, oxidation, reduction, or metabolic synthesis in tiie 
presmce of tunicamycin. 

Also provided by &e invoition are chemically modified derivatives of &e polypeptides of the 
invention which may provide additional advantages such as increased solubihty, stability and 
circulating time of the polypeptide, or decreased inmiunogenicily (e.g., water soluble polymers such 
as polyethylene glycol, ethylene glycol/propylene glycol copolymers, carboxymefeylcellulose, 
dextran, polyvinyl alcohol). The CPPs are modified at random positions within the molecule, or at 
predetermined positions within fb& molecule and may include one, two, three or more attached 
chemical moieties. 

In another embodiment, the invention provides a method of idmiifyhxg a modulator of at least 
one CPP biological activity comprising the steps of: 0 contacting a test modulator of a CPP biological 
activity with the polypeptide conqjrising the amino acid sequence selected from one of the groixps 
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consisting of flie amino acid sequences listed in Table 1 (corresponding to CPP 2, CPP 9, CPP 17, 
CPP 20 and CPP 21); ii) detecting the level of said CPP biological activit)^ and iii) comparing the 
level of said CPP biological activity to that of a control sanq}le lacking ^d test modulator. Where 
the difference in the level of CPP protein biological activity is a decrease, die test modulator is an 
inhibitor of at least one CPP biological activity. Where the difference in the level of CPP biological 
activity is an increase, the test substance is an activator of at least one CPP biological activity. For 
CPP 2, the CPP biological activity tested is preferably increasing / decreasing lipase activity. 

Another aspect of the invention relates to a method of identifying a modulator of a 
cardiovascular disorder comprising the steps of: (a) administering a candidate agent to a non-hmnan 
test animal which is predisposed to be affected or which is affected by the cardiovascular disorder; (b) 
administmng the candidate agent of (a) to a matched control non-human animal not predisposed to be 
affected or not being affected by fhs cardiovascular disorder; (c) detecting and /or quantifying the 
level of at least one polypeptide in a biological san^^le obtained horn the non-bimxan test of (a) or 
control aninml of (b), wherein flie at least one polypeptide is selected from: (i) a polypeptide 
comprising the amino acid sequence selected from one of the groups consisting of SEQ ID NOs: 1-2, 
6-7, 1 1-12, 15-17, and 24-25; (ii) a variant, witii at least 75% sequence identity, having one or more 
amino acid substitutions, deletions or insertions relative to the amino acid sequence shown in SEQ ID 

amino acid substitutions, deletions or insertions relative to the amino acid sequence shown in SEQ ID 
NOs:6, or 7; (iv) a variant, with at least 95% sequence identity, having one or more amino acid 
substitutions, deletions or insertions relative to the amino acid sequence shown in SEQ ID NOs:24, or 
25; and (v) a fragment of a polypeptide as defined in i), ii), iii), or iv) above which is a least ten amino 
acids long; and step (d) comparing the level of the at least one polypeptide of st^ (c); wherein an 
alteration in tiie level of die polypeptide indicates that the candidate agent is a modulator of the 
cardiovascular disorder. A preferred embodiment of the invention provides that the non-human test 
animal which is predisposed to be affected or which is affected by the cardiovascular disorder 
con^rises an increased plasma level of at least one of the polypeptides selected from: (i) a 
polypeptide comprising the amino acid sequence selected from one of the groups consisting of SEQ 
ID NOs: 1-2, 6-7, 1 1-12, 15-17, and 24-25; (ii) a variant, witii at least 75% sequence identity, having 
one or more amino acid substitutions, deletions or insertions relative to tise amino acid sequence 
shown in SEQ ID NOs: 1, 2, 1 1, 12, 15, 16, or 17; (iii) a variant, witii at least 85% sequence identity, 
having one or more amino acid substitutions, deleti<ms or insertions relative to the mymn acid 
sequence shown in SEQ ID N0s:6, or 7; (iv) a variant, with at least 95% sequence identity, having 
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one or more amino acid substitutions, deletions or insertions relative to the anuno acid ^quence 
shown in SEQ ID NOs:24, or 25; and (v) a fiagment of a polypeptide as defined in i), ii), iii), or iv) 
above which is a least ten amino acids long. 

in a still ftirther aq>ect of die invention, a mediod for monitoring the efficacy of a treatmmt of 
a subject having or at ri^ of developing a ^rdiovascular disorder with an agent is provided, ^)^ch 
conqmses steps: (a) obtaining a pre-administration biological sanq>le from the subject prior to 

istration of the agent; (b) detecting and /or quantifying the level of at least one polypeptide in 
the biological sanq)Ie from said subject, wherein the at least polyp^tide is selected from: (i) a 
polypeptide con^nising the amino acid sequence selected from one of die groups consisting of SEQ 
n> NOs:l-2, 6-7, 1 1-12, 15-17, and 24-25; (ii) a variant, widi at least 75% sequmce identity, having 
one or more amino acid substitutions, deletions or insertions relative to the amino acid sequrace 
shown in SEQ ID NOs:l, 2, 1 1, 12, 15, 16, or 17; (iii) a variant, with at least 85% sequmce identity, 
having one or more amino acid substitutions, deletions or insertions relative to the amino acid 
sequence shown in SEQ ED N0s:6, or 7; (iv) a variant, with at l^st 95% sequence idoitity, having 
one or more amino acid substitutions, deletions or insertions relative to the amiao acid sequence 
shown in SEQ ID NOs:24, or 25; and (v) a fragment of a polypeptide as defined in i), ii), iii), or iv) 
above which is a least ten amino acids long; and steps (c) obtaining one or more post-adnunistration 



admimstration san^le or samples; (e) conqjaring the level of the at least one polypeptide in the pre- 
administration sample with the level of the at least one polypeptide in the post- administration sample; 
and (Q adjusting the administration of the agent accordingly. 

In anodier aspect, the invention includes polynucleotides encoding a CPP of the invention, 
polynucleotides encoding a polypeptide having an amino acid sequence selected from one of the 
grciq)s consisting of SEQ ID NOs: 1-5, 6-10, 1 1-14, 15-23 and 24-28, oligonucleotides 
conq)lementaiy to CPP gene sequences for diagnostic and analytical assays (e.g., PGR, hybridization- 
based techniques), and vectors for expressing CPPs. 

hi ano&er aspect, the invention provides a vector conq^rising DNA encoding a CPP. The 
inv^tion also includes host cells and transgenic non-human animals comprising such a vector. 
Hiere is also provided a me&od of making a CPP or CPP precursor. One preferred method comprise 
the steps of (a) providing a host cell containing an expression vector as disclosed above; (b) culturing 
the host cell under conditions whereby the DNA segment is expressed; and (c) recovering the protein 
encoded by the DNA segment Another preferred method comprises the steps of: (a) providing a host 
cell capable of expressing a CPP; (b) culturing said host cell under conditions that allow expression of 
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said CPP; and (c) reoovering said CPP. Within one embodiment the expression vector fur&i^ 
comprises a secretory signal sequence operably linked to the DNA segment, the cell secrete the 
protein into a culture medium, and the protein is recovered finom fte mediimi. An especially prefored 
me^d of making a CPP include chemical syirdiesis using standard peptide synthesis techniques, as 
described in the section titled ''Chemical Manu&cture of CP? Con^jositions'* and in Example 2. 

In another aspect, the invention includes isolated antibodies specific for any of the 
polypeptides, peptide fragments, or peptides described above. Preferably, the antibodies of the 
invention are monoclonal antibodies. Further fnieferred are antibodies that bind to a CPP exclusively, 
that is, antibodies that do not recognize other polypeptides with high affinity. Anti-CPP antibodies 
have purification, diagnostic and prognostic applications. PrefCTed anti-CPP antibodies for 
purification and diagnosis are attached to a label group. Preferred CPP-related disoidcrs for diagnosis 
include coronary artery disease (CAD), coronary heart disease (CHD), peripheral vaanilar disease, 
cerdiral ischemia (stroke), congestive heart &ilure, a1hax>sclerosis, hypertension, and other 
cardiovascular diseases. Diagnostic methods include, but are not limited to, those that employ 
antibodies or antibody-^lmved conq^ositions specific for a CPP antigen. Diagnostic methods for 
detecting CPPs in specific tissue sanqples and biological fiuids (prefoably plasma), and for detecting 
levels of expression of CPPs in tissues, also form part of the invration. Con^sitions coursing 

^within the scope of the invention, for example, for in vivo diagnosis and drug scre^iing assays. 

Hie invCTtion fiirther provides metiiods for diagnosis of cardiovascular disorders that 
comprise detecting in a sample of body fluid, preferably blood plasma, the presence or level of at least 
one CPP disclosed herein or any combination thereof. Further included are methods of using CPP 
compositions, including primers conq>lem6ntary to CPP genes and/or messenger RNA and anti-CPP 
antibodies, for detecting and measuring quantities of CPPs in tissues and biological fluids, preferably 
plasma. These methods are also suitable for clinical screening, prognosis, monitoring tiie results of 
th^py, identifying patients most likely to respond to a particular ther^eutic treatment, drug 
screening and development, and identifying new targets for drug treatment. 

The invention provides kits that may be used in the above-recited methods and that may 
comprise smgle or multiple prqjarations, or antibodies, together with other reagents, label groups, 
substrates, if needed, and directions for use. The kits may be used for diagnosis of disease, or may be 
assays for the identification of new diagnostic and/or ther^eutic agmts. 

In one embodiment. Coronary Artery Disease (CAD) is defined by the appearance of at least 
one syn^tom. Such symptoms become more serious as the disease progresses. CAD is often 
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acconQ)anied by reduced left vratricle capacity or ou^ut Early CAD symptoms include elevated 
plasma levels of cholesterol and low-density lipoprotein (especially oxidized forms), as well as 
platelet-rich plasma aggregations. The vascular endothelium responds to inflammation and thus 
formation of plaques and levels of inflammatory and fibrinogenic factors increase, hi addition, CAD, 
or ath^sclerosis, is characterized by vascular calcification and hardening of the arteries. The 
resulting partial occlusion of the blood vessels leads to hypertensicm and ischemic heart disease. 
Eventual conqilete vascular occlusion results in myocardial infarction, stroke, or gangrene. 

hi a preferred embodiment, detection of increased plasma levels of at least one CPP of Ihe 
invention indicates an increased risk that an individual will develop CAD. Preferably, said detection 
indicates that an individual has at least a l.OS-foId, LI -fold, 1.15-fold, and more pref^ably at least a 
1.2-fold increased likelihood of developing CAD. Alternatively, detection of increased plasma levels 
of at least one CPP of the invention indicates that an individual has CAD. The anu^unt of CPP 
mcrease observed in an individual compared to a control ^nq^le will correlate with the certainty of 
the prediction or diagnosis of CAD. As individual plasma CPP levels will vary depending on femily 
history and other risk fectors, each wiD preferably be examined on a case-by-case basis. In preferred 
embodimaits, CPP is detected m a human plasnm sample by the methods of the mvention. Especially 
preferred techniques are mass spectrometry and immunodetection. Preferably, a prediction or 



in the experimental CPP level as conq>ared to the control. 

The invCTtion fiirther includes methods of using CPP-modulating compositions to prevrat or 
treat disorders associated with aberrant expression or processing of CPPs of SEQ ID NOs: 1-28 in an 
individual. Preferred CPP-related disorders include coronary artery disease (CAD), coronary heart 
disease (CHD), peripheral vascular disease, cerebral ischemia (stroke), congestive heart failure, 
atherosclerosis, hypertension, and other cardiovascular diseases. A preferred embodiment of the 
invention is a mefliod of prevrating or treating a OPP-related disorder m an individual conqjrising the 
steps of: determining that an individual suffers firom or is at risk of a CPP-related disord^ and 
introducing a CPP-modulating composition to said individual. 



Further aspects of the invention are also described in the specification and in the claims. 



BRTRy nFSrRTP TION OF THE SEOTJENfTF. T.TsymVG 
SEQ ID NOs: 1 and 2 describe the amino acid sequences of the polypeptides present m plasma 
san^les of mdividuals with coronary artery disease (h^einafter, SEQ ID NO;2 is designated CPP 2). 
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SEQ ID NOs:3-S are the amino acid sequences of Hyptic peptides found in MS-MS and'or 
MS-MALDI mass spectrometry in plasma samples of individuals with caronary artery disease. 

SEQ ID NO:6 desCTibes the amino acid sequence of eosinophil-derived neurotoxin (EDN, 
hereinafter CPP 9), ^^^lereas SEQ ID NO:7 is the polypeptide sequence of the mature protein. 

SEQ ID NOs:8, 9, and 10 are the amino acid sequences of tryptic peptides found by MS-MS 
mass spectrometry in plasma samples of individuals with coronary artery disease. 

SEQ ID NO: 1 1 describes the amino add sequence of Human ^ididymal secretory pvrotein 
(HE) 1, whereas SEQ ID NO: 12 is the polypqrtide sequence of the mature protein (ha-einafter, CPP 
21). 

SEQ ID NOs: 13-14 are the amino acid sequences of tiyptic p^tides found by MS-MS mass 
spectrometry at a higher level in plasma san^les of individuals with COTonary artery disease. 

SEQ ID NO: 15 desCTbes the amino acid sequence of Defensin 1 precursor, whereas SEQ ID 
NO: 16 is the polypeptide sequence of the preprotein and SEQ ID NO: 17 is the sequence of Defensin 1 
(hereinafter, CPP 17). 

SEQ ID NOs: 1 8-23 are the amino acid sequences of tiyptic peptides found by MS-MS mass 
spectrometzy predominantly in plasnm sanq^les of individuals with coronary artery disease. 
SEQ ID NO:24 describes the amino acid sequence of Plasminogen-related protein B 

CPP 20). 

SEQ ID NOs:26-28 are the amino acid sequ^oces of tryptic peptides found by tandem mass 
spectrometry in plasnm samples of individuals with Coronary Artery Disease. 

BRIEF DESCRIPnON OF TEfF CTCT TRF. 
Figure I shows mature human Colipase polypeptide sequences (SEQ ID NOs:l and 2) and the 
sequences of the tryptic pq>tides identified by tandem mass spectrometry in the plasma of individuals 
with coronaiy artery disease (SEQ ID NOs:3-5), The tiyptic peptides are underlined in SEQ ID 
NOs:l and 2. 

Figure 2 shows the sequence of CPP 9 (SEQ ID NO: 6) and the peptide sequences foimd by 
MS-MS mass spectrometiy in the plasma of individuals with coronary artery disease (SEQ ID NOs:8- 
10). The tryptic peptides observed by tandem mass spectrometry are imderlined in SEQ ID NOs:6 and 
7. The signal peptide is double-underlined. 

Figure 3 shows tiie sequence of the precursor protein (SEQ ID NO:l 1) and of the mature 
protein (SEQ ID NO: 12, CPP 21) and pcptidst sequ«ices found by MS-MS mass spectrometry m the 
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plasma of individuals witii coronary artery disease (SEQ ID NOs: 13-14). The tryptic peptides 
observed by tandem mass spectrometry are underlined in SEQ ID NOs: 1 1 and 12. The signal peptide 
is double-undoiined. 

Figure 4 shows the sequence of CPP 17 (SEQ ID NO: 17) and the peptide sequences found by 
MS-MS mass spectrometry in the plasma of individuals with coronary artery disease (SEQ ID 
NOs: lS-23). The tryptic peptides observed by tandem mass spectrometry are underlined in SEQ ID 
NOs: 15 and 16, which r^xresent the precursor and preprotein, respectively. The signal peptide is 
double-underlined in SEQ ID NO: 15. 

Figure 5 shows the sequence of the precursor (SEQ ID NO:24) and mature protein (CPP20, 
SEQ ID NO:25) of the invention, and peptide sequences found by tandem mass spectrometry in the 
plasma of individuals with Coronary Artery Disease (SEQ ID NOs:26-28). The tryptic peptides 
observed by tandem mass spectrometry are underlined in SEQ ID NOs:24 and 25. The signal peptide 
is double-underlined in SEQ ID NO:24. 

mr.TAn.l<n DESCRIPTION OF TRF TXWlVTIQN 
The present invention described in detail below provides methods, con:q>ositions, and kits 
useful for screening, diagnosis, and prognosis of a cardiovascular disorder in a mammalian individual; 

the results of cardrovascular disorder therapy; for screaiing CPP modulators; and for drug 
development. The invention also encompasses the administration of therapeutic corrqfx>sitions to a 
mamm a li a n individual to treat or prevent cardiovascular disorders. The maimnalian individual may be 
a non-human mammal, but is preferably human, more preferably a human adult. For clarity of 
disclosure, and not by way of limitation, tiie invention will be described widi respect to the aimlysis of 
blood plasma ^mples. However, as one skilled in the art wiU appreciate, the assays and techniques 
described below can be applied to other biological fluid sanq>les (e.g. cerebrospinal fluid, lynqsh, bile, 
serum, sahva or urine) or tissue samples from an individual at risk of having or developing a 
cardiovascular disorder. ThQ methods and con^ositions of the present invention are useful for 
screening, diagnosis and prognosis of a living individual, but may also be used for postmortem 
diagnosis in an individual, for example, to identify &mily members ^o are at risk of developing the 
samedisOTd^. 

D^initions 

As used herein, the term "nucleic acids" and '"nucleic acid molecule" is intended to include 
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DNA molecules (e.g., cDNA or genomic DNA) and RNA molecules (e.g^ mRNA) and analogs of the 
DNA or RNA graerated using nucleotide analogs. The nucleic acid molecule can be single-stranded 
or double-stranded, but preferably is double-stranded DNA Throughout the present specification, the 
expression ''nucleotide sequence" may be employed to designate indifferently a polynucleotide or a 
nucleic acid. More precisely, the esqiression "nucleotide sequence" encompasses the nucleic material 
itself and is thus not restricted to the sequence infonnation (i.e. the succession of letters chosen 
among the four base letters) that biochemically characterizes a specific DNA or RNA molecule. Also, 
used interchangeably herein are terms "nucleic acids", "oligonucleotides", and "polynucleotides". 

An "isolated" nucleic acid molecule is one which is separated from other nucleic acid 
molecules ^niiich are present in the natural source of the nucleic acid. Prefraibly, an "isolated" nucleic 
acid is free of sequences which naturally flank the niicleic acid (i.e., sequences located at the 5' and 3' 
ends of the nucleic acid) in the genomic DXA of the organism from which the nucleic acid is derived. 
For exanqile, in various embodiments, &e isolated CPP nucleic acid molecule can ccmtain less than 
about 5 Id), 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0. 1 kb of nucleotide sequence which naturaUy flank the 
nucleic acid molecule in genomic DNA of the cell frxrm which the nucleic acid is dmved. Moreov^, 
an "isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free of other 
cellular material, or culture mediimi whsn produced by recombinant techniques, or substantially free 

nucleic acid as a hybridization probe, CPP nucleic acid molecules can be isolated using standard 
hybridization and cloning techniques (e.g., as desCTibed in Sambrook, J., Fritsh, E. F., and Maniatis, 
T. Molecular Qoning. A Laboratory Manual. 2nd, ed.. Cold Spring Harbor Laboratory, Cold Spring 
Haibor Laboratory Press, Cold Spring Harbor, N.Y., 1989). 

As used herein, the term 'liybridizes to" is intended to describe conditions for moderate 
stringency or high stringency hybridization, preferably where the hybridization and washing 
conditicms permit nucleotide sequences at least 60% homologous to each other to remain hybridized 
to each other. Preferably, die conditions are such that sequences at least about 70%, more preferably 
at least about 80%, even more preferably at least about 85%, 90%, 95% or 98% homologous to each 
other typically remain hybridized to each other. Stringent conditions are known to those skilled in the 
art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3. 1-6.3.6. In a preferred, non-limiting exsanple, stringent hybridization ccmditions for nucleic acid 
interactions are as follows: the hybridization step is realized at 65*'C in the presence of 6 x SSC 
buffer, 5 x Denhardt*s solution, 0,5% SDS and lOOfxg^il of salmon sperm DNA The hybridization 
step is followed by four washing steps: 



13 



wo 2004/090551 



PCT/EP2004/003737 



- two washings during 5 mm, preferably at 65X in a 2 x SSC and 0.1%SDS buffer, 

- me washing during 30 min, prefCTably at 65*^0 in a 2 x SSC and 0.1% SDS buffer, 

- one washing during 10 min, pre£mbly at 6S''C in a 0.1 x SSC and 0.1%SDS buffer, 

these hybridization conditions being suitable for a nucleic acid molecule of about 20 nucleotides in . 
IcngftL it will be appreciated that the hybridization conditions desCTibed above are to be ads^ted 
according to the length of the desired nucleic acid, following techniques well known to the one skilled 
in the art, for example be adapted according to the teachings disclosed in Hames B JD. and Higgins 
S J. (1985) Nucleic Acid Hybridization: A Practical Approach. Hames and Higgins Ed., IRL Press, 
Oxford; and Current Protocols in Molecular Biology. 

Percent homology" is used herein to refer to both nucleic acid sequences and amino acid 
sequences. Amino acid or nucleic acid "identify" is equivalrat to amino acid or nucleic acid, 
"homology". To determine the percent homology of two amino acid sequ^ces or of two nucleic 
acids, the sequences are aligned for optimal conq>arison purposes (e.g., gaps can be introduced in the 
sequence of a first amino acid or nucleic acid sequence for optimal alignment with a second amino or 
nucleic acid sequmce and nonhomologous sequences can be disregarded for con^arison purposes). 
The length of a reference sequence aligned for con^arison purposes is at least 30%, preferably at 
least 40%, more preferably at least 50%, even more preferably at least 60%, and evra more preferably 

nucleotides at correspondmg amino acid positions or nucleotide positions are then compared. When a 
position in the first sequence is occupied by the same amino acid residue or nucleotide as the 
corresponding position in die second sequence, then the molecules are homologous at that position. 
The percent homology between the two sequences is a function of the number of identical positions 
shared by the sequences (i.e., % homology=# of idoitical positions/total # of positions 100). 

The conq)arison of sequences and determination of percmt homology between two sequences 
can be acconsplished using a mathematical algorithm. A preferred, non-limiting exanqile of a 
mathematical algorithm utilized for the comparison of sequences is the algorithm of Karhn and 
Altschul (1990) Proc. Natl. Acad, Sci. USA 87:2264-68, modified as in Karlin and Altschul (1993) 
Proc. Nad. Acad Sci. USA 90:5873-77, tiie disclosures of which are incorporated herein by reference 
in their entireties. Such an algoritimi is incorporated into the NBLAST and XBLAST progran^ 
(version 2.0) of Altschul, et al, (1990) J. Mol. Biol. 215:403-10. BLAST nucleotide searches can be 
performed with the NBLAST program, score=100, wordlength=12 to obtain nucleotide sequences 
homologous to the sequences of the invention. BLAST protein searches can be p^ormed with the 
XBLAST program, score=50, wordlCTgth=3 to obtain amino acid sequences homologous to the 
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polypeptide sequences of tiie invention. To obtain gapped alignments for conq}aiison purposes. 
Gapped BLAST can be utilized as described in Altschul et aL, (1997) Nucleic Acids Research 
25(17):3389-3402. When utilizing BLAST and Gapped BLAST programs, fee defeult parameters of 
the respective programs (e.g., XBLAST and NBLAST) can be used. See 

fat^://www.ncbi.nhnjnh.gov, the disclosures of which are incorporated herein by reference in their 
aitireties. Another preferred, non-linutrug exan5)le of a mathematical algorithm utilized for the 
comparison of sequences is the algorithm of Myers and Miller, CABIOS (1989), the disclosures of 
\^4lich are incorporated herein by reference in their entireties. Such an algorithm is incorporated into 
the ALIGN program (version 2.0) which is part of the GCG sequ^ce ahgmnent software package. 
When utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weigfht residue 
table, a gap Iragth penalty of 12, and a gap poialty of 4 can be used. 

The term '*polypqjtide" refers to a polymer of amino acids witliout regard to the length of the 
polymer, thus, peptides, oligopeptides, and jHioteins are included within the definition of polypeptide. 
This tCTn also does not specify or exclude post>translational modifications of polypeptides, for 
example, polypeptides which include tiie covalent attachment of glycosyl, acetyl, phosphate, amide, 
lipid, carboxyl, acyl, or carbol^rdrate groups are expressly encompassed by the term polypeptide. 
Also included within the definition are polypeptides winch contain one or more analogs of an amino 

naturaUy in an imrelafedbiolo^cal system, modified amino acids firom mammalian systems etc.), 
polypeptides with substituted linkages, as well as othra- modifications known in the art, both naturally 
occurring and non-naturally occurring. 

The term *^protein" as used herein may be used synonynoously with the term "polypeptide" or 
may refer to, in addition, a complex of two or more polypeptides which may be linked by bonds otho: 
than peptide bonds, for example, such polypeptides imking \jp the protein may be linked by disulfide 
bonds. The term "protein" may also compieh^d a family of polypeptides having identical amino 
acid sequences but different post-translational modifications, particularly as may be added when such 
proteins are expressed in eukaryotic hosts. 

An "isolated" or •'purified" protein or biologically active portion thereof is substantially free 
of cellular n^terial or other contaminating proteins ftom the cell or tissue source from which the 
protein of the invention (i.e., CPP or biologically active fragment tiiereof) is derived, or substantially 
fiw from chemical precursors or other chemicals when chemit^Uy synthesized. The language 
"substantially free of cellular material" includes preparations of a protein according to the invention in 
which the protein is separated from cellular conqjonents of the cells from which it is isolated or 
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recombinantly produced In one CTibodimmt, the language "substantiaDy free of cellular material" 
includes preparations of a protein according to the invention having less than about 30% (by dry 
weight) of protein o&e^ than tiie protein of the invention (also referred to herein as a "contaminating 
protein'O, more preferably less than about 20% of protein other than the protein according to the 
invmtion, still more prefmbly less than about 10% of protein ofhsr than the protein according to the 
invention, and most preferably less than about 5% of protein other than the protein according to the 
invention. When the protein according to the invention or biologically active p(»:tion tiiereof is 
recombinantly produced, it is also preferably substantially finee of culture medium, i.e., ciilture 
medium represents less than about 20%, more preferably less than about 10%, and most preferably 
le^ than about 5% of the volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemic^s" includes 
preparations of a pxotein of the invention in which the protein is separated from chmiical precursors 
or otiier chemicals which are involved in the synthesis of the protein, in one embodiment, the 
language "substantially free of chemical precursors or other chemicals" includes preparations of a 
protein of the invention having less than about 30% (by dry weight) of chemical precursors or non- 
protein chemicals, more preferably less than about 20% chemical precursors or non-protein 
chemicals, still more preferably less than about 10% chemical precursors or non-protein chemicals. 



Hie term '"recombinant polypeptide" is used herein to refer to polypeptides that have been 
artificially designed and which comprise at least two polypeptide sequ^ices that are not foimd as 
contiguous polypeptide sequences in their initial natural environment, or to refer to polypeptides 
which have been exinressed &om a recombinant polynucleotide. 

The term ''Cardiovascular disorder Plasma Polyp^tide'' or refers to a polypeptide 

conqnising the sequraice described by any one of SEQ ID NOs: 1-28. Such polypeptide may be post- 
translationally modified as described herein. CPPs may also contain other structural or chemical 
modifications such as disulfide linkages or amino acid side chain interactions such as hydrogen and 
amide bonds that result in complex secondary or tertiary structures. CPPs also include mutant 
poljrpeptides, such as deletion, addition, swap, or truncation mutants, fusion polypeptides comprising 
such polypeptides, and polypeptide finagments of at least fliree, but preferably, and where applicable, 
8, 10, 12, 15, or 21 contiguous amino acids of the sequmce of SEQ ID NOs:l-28. Furth^ included 
are CPP proteolytic precursors and intCTnediates of the sequence selected from the group consisting 
of SEQ ID NOs: 1-28. Hie invention embodies polypeptides oicoded by the nucleic acid sequraces of 
CPP genes or CPP mRNA species, preferably himian CPP genes and mRNA species, including 
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isolated CPPs consisting of, consisting essentially of, or conqirising lite sequence of S£Q ID NOs:l- 
28. Preferred CPPs have a sequence conqirising one of the sequences of SEQ ID NOs: 1-2, 6-7, 1 1- 
12, 15-17, and 24-25. Preferred CPP fragments have a sequence c om prising one of the sequaices of 
SEQ ID Nos: 3-5, 8-10, 13-14, 18-23 and 26-28. Preferred CPPs retain at least one biological activity 
of CPPs of SEQ ID NOs:1.28. 

The term "biological activity" as used herein refers to any single function carried out by a 
CPP. These include but are not limited to: (1) indicating &at an individual has or will have a 
cardiovascular disorder, (2) circulating through the bloodstream of individuals with a cardiovascular 
disorder; (3) antigenicity, or tiie ability to bind an anti-CPP specific antibody; (4) immunogenicity, or 
flie ability to gmerate an anti-CPP specific antibod>r, and for CPP 2: (5) interacting with a CPP target 
protein, prefraably a lipase; (6) stabilizing the active site of a lipase; (7) increasing lipase activity; (8) 
interacting with a CPP target molecule such as a phospholipid, miceUe, or tri^yceride; and (9) 
forming at least one disulfide bond; for CPP 9: (5) forming intramolecular amino acid side chain 
interactions such as hydrogen, amide, or preferably disulfide links; (6) interaction with a CPP target 
molecule, preferably an RNA molecule or virion (such as respiratory syncytial virus or RSV); (7) 
antiviral activity, and (8) hydrolysis of KNA phosphodiester bonds; for CPP 21: (5) forming 
intramolecular amino acid side chain interactions such as hydrogen, amide, or preferably disulfide 

forming intramolecular amino acid side chain interactions such as hydrogen, amide, or preferably 
disulfide links; (6) interaction with a CPP target molecule, iH^eferably a bacterial endotoxin; (7) 
neutralizing bacterial endotoxins; (8) promoting mast cell chemotaxis; (9) undergoing 
posttranslational processing, for example, specific proteolysis; (10) functioning as an antimicrobial 
defense; and (1 1) inhibiting contraction of smooth muscle cells; and for CPP 20: (5) forming 
intramolecular amino acid side chain interactions such as hydrogen, amide, or preferably disulfide 
hnks; (6) interacting with a CPP target molecule, in particular, a kringle domain-containing peptides 
such as plasminog^ and (7) reducing tumor growth. 

As used herein, a '^CPP modulator'* is a molecule (e.g., polynucleotide, polyp^jtide, small 
molecule, or antibody) that is capable of modulating (i.e., increasing or decreasing) either the 
e7q>ression or the biological activity of the CPPs of the invmtion. A CPP modulator that enhances 
CPP expression or activity is described as a CPP activator or agonist. Conversely, a CPP modulator 
tiiat represses CPP ejq>ression or activity is described as a CPP inhibitor or antagonist Preferably, 
CPP modulators increase/ decrease the expression ax activity by at least 5, 10, or 20%. CPP inhibitors 
include anti-CPP antibodies, fragments thereof, antisense polynucleotides, and molecules 
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characterized by screening assays, as described herein. CPP agonists include polynucleotide 
expression vectors and molecules characterized by screening as^ys as described herein. 

A 'XIOT-related disorder" or "CPP-related disease" describes a cardiovascular disorder. 
Preferred disorders include coronaiy artery disease (CAD), coronary heart disease (CHD), peripheral 
vascular disease, cerebral ischCTua (stroke), congestive heart failure, atheroscl^sis, hypertension, 
and other cardiovascular diseases. Preferably, the likelihood that an individual will develop or 
already has such a disorder is indicated by higher than normal plasma levels of at l^st one CPP, 

Anolhra- aspect of the invention pertains to anti-CPP antibodies. The term "antibody" as used 
herein refers to immunoglobulin molecides and immunologically active portions of immunoglobulin 
molecules, i.e., molecules that contain an antigen-binding site which specifically binds (immunoreacts 
with) an antig^, such as CPP, or a biologically active firagment or homologue thereof. Preferred 
antibodies bind to a CPP exclusively and do not recognize other polypeptides with hig)a. afSnity. 
Examples of immimologically active portions of iimnunoglobulin molecules include F(ab) and F(ab% 
fragm^ts winch can be generated by treating the antibody with an enzyme such as pq>sin. The 
invention provides polycloiml and monoclonal antibodies that bind a CPP, or a biologically active 
fiagmoit or homologue thereof The t^m "monoclonal antibody" or "monoclonal antibody 
conxposition", as used herein, refers to a population of antibody molecules that contain only one 

monoclonal antibody con:q>osition thus typically displays a single binding affinity for a particular CPP 
widi which it immunoreacts. Preferred CPP antibodies are attached to a label groijp. 

As used herein, a 'label group** is any conq>ound that, when attached to a polynucleotide or 
polypeptide (including antibodies), allows for detection or purification of said polynucleotide or 
polypeptide. Label groiq)s may be detected or purified directly or indirectly by a secondary 
compound, including an antibody specific for said label group. Usefiil label groiq)s include 

radioisotopes (e.g., P, S, H, Q, fluorescent conqsounds (e.g., S-bromodesoxjoiridin, 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, 
dansyl chloride, phycoerythrin acetylaminofluorene, digoxig^iin), luminescent compounds (e.g., 
limnnol, GFP, luciferin, aequorin), en2ymes or en^one co-fector detectable labels (e.g., peroxidase, 
luciferase, alkaline phosphatase, galactosidase, or acetylcholinesterase), or confounds that are 
recognized by a secondary &ctor such as strepavidin, GST, or biotin. Preferably, a label groi^ is 
attached to a polynucleotide or polypeptide in such a way as to not interfrae with the biological 
activity of the polynucleotide or polypeptide. 

Radioisotopes may be detected by direct counting of radioemission, film exposure, or by 
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sdntillaticm counting, for example. Enzymatic labels may be detected by detennination of conversion 
of an apjiropriate substrate to product, usually causing a flucs^cent reactiozL Fluorescent and 
luminescent conqK>imds and reactions may be detected by, e.g^ radioemission, fluorescent 
microscopy, fhiorescoit activated cell sorting, or a luminometer. 

As used herein with respect to antibodies, an antibody is said to "selectively bind" or 
"specifically bind" to a target if the antibody recognizes and binds the target of interest but does not 
substantially iecogni25e and bind other molecules in a sample, e.g., a biological sanq)le, which 
includes the target of interest 

As used herein, the term "vector" refers to a nucleic acid molecule capablG of transporting 
anodier nucleic acid to which it has been linked. One type of vector is a "plasmid", which refes to a 
circular double stranded DNA loop into which additional DNA segn^ts can be ligateA Anotiier ^e 
of vector is a viral vector, wherein additional DNA segmmts can be ligated into the viral genome. 
Certain vectors are capable of autononK>us r^lication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). 
Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell 
i^on introduction into the host cell, and thereby are rephcated along with the host genome. Moreover, 
cratain vectors are capable of directing the e^cpression of gmes to which they are operatively linked. 

j^iGSBSdiasait DNA techniques are often in the form of plasmids. In the present specification, 
"plasmid" and "vector" can be used interchangeably as the plasmid is the most commonly used form 
of vector. However, the invention is intended to include such other forms of expression vectors, such 
as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated virus^), 
which serve equivalent Amotions. 

As used herein, "effective amount" describes the amount of an agent, preforably a CPP 
modiilator of the invention, suf&cient to have a desired e£fect. For example, an anticardio>^iscular 
disorder effective amount is the amount of an agent required to reduce a symptom of a cardiovascular 
disorder in an individual by at least 1, 2, 5, 10, 15, or preferably 25%. The term may also describe the 
amount of an agent required to ameh'orate a cardiovascular disorder-caused symptom in an individual. 
Common synq>toms of cardiovascular disorders include: chest pressure, heartburn, nausea, vomiting, 
numbness, shortn^s of breath, heavy cold sw^ting, un^lained fetigue, and feelings of anxiety. The 
more severe symptoms of cardiovascular disorders are chest pain (angina pectoris), rhythm 
disturbances (airhytfamias), stroke, or heart attack. The effective amount for a particular patient may 
vary depending on such &ctors as flie diagnostic method of the symptom being measured, the state of 



19 



wo 2004/090551 



PCT/EP2004/003737 



the condition being treated, the overall health of the patiait, metiiod of admirdstration, and fee 
severity of side-effects. 

CPPs cf the invention 

Hie Cardiovascular disorder Plasna Polypeptides (CPPs) of the invention are described in the 
sequence listing as SEQ ID NOs: 1-2, 6-7, 1 1-12, 15-17, and 24-25. CPPs comprising an amino acid 
sequmce selected from one of the groups consisting of SEQ ID NOs: 1-2,6-7, 11-12, 15-17,and24- 
25 are secreted and circulate at a higher level in blood plasma of individuals tihat have or are at risk of 
developing a cardiovascular disorder. 

Further included CPPs are polypeptides c oi ii pi i sin g an amino acid sequ^ce selected from one 
of the groups consisting of SEQ ID NOs: 3-5, 8-10, 13-14, 18-23 and 26-28. Such CPPs also are 
secreted and circulate in the plasma. Preferably, such CPPs also comprise additional amino acids 
from one of the groups of SEQ ID NO: 1-2, 6-7, 11-12, 15-17, and 24-25. Such additional amino 
acids are fused in frame with the selected sequence to form contiguous amino acid sequence from the 
proteins of SEQ ID NO: 1-2, 6-7, 11-12, 15-17, and 24-25. 

Interestingly, levels of flie CPPs of the invention are increased in flie plasma of individuals 
suffmng from cardiovascular disOTders. As such, the CPPs of the invention provide a useful 

presence of, a cardiovascular disorder. Further, CPPs are useful for drug design and in therapeutic 
strategies for prevention and treatment of cardiovascular disorders. 

CPP 2, of SEQ ID NO:2, represents the sequence of the protein Colipase. Colipase is a small 
protein cofactor for pancreatic triglyceride hpase (PTL), which is required for efficient dietary lipid 
hydrolysis. Mature colipase is 90 ammo acids in length, with five conserved disulfide bonds. The 
protein is synthesized as preprocolipase, with 112 amino acids, and processed to procolipase, fix>m 
which an additional 5 amino acids are cleaved at the N-terminus. Colipase is a secreted protein that 
binds to the C-terminal, non-catalytic domain of the lipase, thereby contributing to the hydrophobic 
active binding site; stabilizing activity in the presence of inhibitory substances such as bile acids; and 
directing the enzyme to the oil-water mter&ce. Five residues of colipase form polar intCTctions witii 
PTL: Arg39, Glu40, Glu59, Arg60, and Asn84. In the active confommtion, GlulO and Arg33 also 
interact with PTL. Hie lipase-colipase conq)lex is activated by micelles, which also stabilize the open 
conformation and expose the hydrophobic active site. PTL and coHpase function in the duodenum, 
releasing 50-70% of dietary fatty acids (see Crandall WV and Lowe ME, J Biol Chem (2001) 
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276:12505-12; van Tilbeurgh H, et al., Biochim Biophys Acta (1999) 1441: 173-84; Pignol D, et al., 
JBC, (2000) 275:4220^224; and Lowe ME, et al„ Amiu, Rev. Nutr (1997) 17:141-58). 

As colipase is normally present and active in the duodenum, its presence in human blood 
plasma is indicative of a biological abnormality. This is further evidenced by the absaice of the CPPs 
of the present invention in normal human plasma. Colipase has previously been observed in fee serum 
of patients with acute pancreatitis (Junge, W. and Leybold, K, Cain.ChinLActa (1982) 123(3), 293- 
302). As such, CPPs of die invention are useful for drug design and in therapeutic strategies for 
prevmtion and treatment of cardiovascular disorders, hit^stingjy in this regard, a therapeutic 
approach against obesity has been developed trough the use of a lipase inhibitor, Orlistat (Stemby, 
B, et al., Clin.Nutr. (2002) 21(5), 395-402). The mhibitor is tiiougjit to act through the reduction of 
tngiycerides intal^, and has been observed to cause steatonhoea, a condition also associated with 
defects in colipase (Gaskm, KJ. et al.. Gastroenterology (1984) 86(1), 1-7). 

Full loigth CPP 9 (SEQ ID NO: 7) forms four disulfide bonds; is glycosylated, including a 
rare C-linked glycosylation; is a n^jor eosinophil granule conq)onent, degrade RNA phosphodiester 
bonds, and has antiviral activity. Full length CPP 9 has the sequence of nonsecretory RNase 2. 
Ribonucleases catalyze ^ hydrolysis of phosphodiester bonds in RNA chains. Human ribonucleases 

(H)N), nonsecretory RNase 3 or eosinophil cationic protein (ECP), RNase 4, angiogenin, and the 
recently described RNase k6. RNases 2-4 and 6, ijv^ich belong to the RNase A family, have 
antipathogenic activities (Zhang J and Rosenberg HF, Genetics 2000; 156: 1949-58). EDN, a 
nonsewetory ribonuclease, degrades mRNA by hydrolyzing single-stranded polyribonucleotides in the 
3' to 5' direction. EDN is a major eosinophil granule protein, and as such is found in tissues affected 
by eosinophils, including pancreas, liver, lung, spleen, and body fluids. 

hici^ed serum levels of human Rnase 1 are indicative of pancreatic cancer (Femandez- 
Salas, et al., Eur. J. Biochem., 2000, 267(5): 1484-94). hitraventricular injection of EDN, in particular, 
has been iniplicated in promoting the Gordon phenom^on, a hyper-eosinophilic syndrome resulting 
in loss of Purldnje cells and cerebellar dysfunction (Newton, DX. et al., J.Neurosci., 1^4, 14(2): 
538-44). In addition, eosinophils and EDN re^rand to the respiratory disease caused by the sfngle- 
stranded RNA virus respiratory syncytial virus (RSV; femily Paramyxoviridae). Eosinophik are 
recruited to the lung parenchyma in response to RSV mfection. Once recruited, they interact with and 
are activated directly by RSV virions, resultmg in degranulation (I>oniachowske JB, et al.. Nucleic 
Acids Res 1998 Dec l;26(23):5327-32). EDN promotes the destruction of extracellular RSV virions 
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via an RNase-dependent, yet £DN-^>ecific, mechanisnL 

The CPP 21 of the invention (SEQ ID NO:12) represeit a plasma form of the Human 
Epididymal secretory protein (HE)1 . HEl, a novel human epididyn^ gene product isolated by 
differential screening, predicts an abundant, small secretozy glycopeptide. HEl is encoded by a well- 
conserved, single-copy gene (Kirchhoft et al., Biol R^rod, 1996, 54:847-56). The gene is broadly 
expressed, with highest levels in testis, kidney, and liver. HEl protein is normally present in the 
l3^sosomal compartment and makes up a major component of epididymal fluid secretions. 

The inherited neurodegenerative disease Niemann-Pick Type C2 (NPC2) results &om 
mutations in the HEl (or NPC2) gene that cause abnonz^y high cellular cholesterol accumulation 
(Naureckiene S et al.. Science (2000) 290:2298 and WO 02059369). Niemann Pick type C disease is 
associated with intracellular cholesterol and giycolipid trafficking defects. (Ong YY et al., ESqj Brain 
Res (2001) 141:218-3 1). In NPC2 patients, cholesterol is not transported through the late endosomal / 
lysosomal system and accumulates in the lysosomes, mean\iv^e depriving cells of free cholesterol. 
Consequently, the cells produce additional cholesterol to compensate for the defect, despite die 
aheady excessive chol^terol accumulation (Garver WS et al., Curr Mol Med, (2002) 2:485-505). A 
hydrophobic, sterol-binding pocket of HEl specifically interacts with cholesterol with hi^ afiBnity 

Using the n»thods of the current invention, HEl has been discov^ed in the plasma of 
Coronary Artery Disease patients. This protein is normally associated with intracellular trafBckmg 
and epididymal secretions, and dius is not expected in the plasma of control individuals. Thus, HEl is 
an important biomarker for cardiovascular disorders. 

CPP 17 (SEQ ID NO; 17) is the plasma form of the antimicrobial peptides Defensin 1-3. 
Defensins 1-3 (alpha-defensins) were cloned from a leukemia cell library, and expression was found 
in normal bone marrow and circulating lynqihocytes Q^aher, et al., PNAS. 85:7327-7331 (1988)). 
Alpha-defensins are constitutively expressed in neutrophils and can make up to 5% of cellular protein. 
The fiill-length protems are processed to form 29-35 amino acid peptides that are released in response 
to microbial and viral infection. These cationic p^tides insert and disrupt prokaiyotic membranes 
(Hill, et al. Science 251:1481-1485 (1991)). In addition, alpha-def<Misms reduce HIV replication in 
response to fee CDS T cell fector CAP (Zhang, et al. Science 298:995-1000 (2002)). 

Alpha-defensins bind low-density lipoprotein (LDL) particles in fee blood and prevait LDL 
receptor-binding and subsequent degradation (Higazi, et al. Blood 96:1393-1398 (2000) and 
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refbrences cited th^reia). It has since been discovered that a^ha-defensins interact with vesicular 
smooth muscle cells and inhibit conlraction (Na^ar, et al. Blood 100:4026-32 (2002)). 

The tnvmtors have revealed an elevated level of alpha-defensin peptides in fhe plasma of 
individuals with Coronary Artery Ehsease. These results provide direct evidence that alpha-defrasins 
are useful plasma biomark^ far the disease. 

CPP 20 (SEQ ID NO:25) is a plasma form of the Plasminogen-related Protein B. 
Plasminogen-related protein B (also Plasuniiar) was cloned from chondrocytes and is highly 
homologous to the N terminus of plasminogen (Ichinose, Biochmnstry 31:3113-8 (1992)). The 96 
amino acid-long precursor protein is secreted after removal of a 19 amino acid signal sequence. 
Plasnnnogm-related protein B binds to kringle domains and is bought to intefere with plasminogen 
binding to fibrin or alpha-2 anti-plasmin (Lewis, et al., Eur J Biochem. 259:618-^25 (1999)). 
Plasminogen-related protein B is expressed in metastatic tumor cells (WO 932l341and Weissbach 
and Treadwell, Biochem. Biophys. Res. Commun. 186:1 108-1 1 14 (1992)). In addition, Lewis, et al. 
(Anticancer Res 21 :2287-91 (200 1) demonstrated tfiat the protein reduces &e growfli of tumors 
ejqplanted in mice, while plasminogen has little effect. Thus, it is described as a treatment for 
malignant conditions as well as a treatment for angiogenesis-related disorders (WO 9946282). 

discovered in the plasma of Coronary Artery Disease patients. This protein is usually associated with 
die extracelluar matrix and is undetectable in the plasma of control individuals. Thus, Plasminogen- 
related protein B is an inqportant biomarker for cardiovascular disorders. 

The terms ^'Cardiovascular disorder Plasma Polypeptide" and "CPP** are used herein to 
embrace any and all of the peptides, polypeptides and proteins of the present invention. Also forming 
part of the invention are polypeptides encoded by tiie polynucleotides of the invention, as well as 
fusion polypeptides conQnising such polypeptides. Hie invention embodies CPPs firom humans, 
including isolated or purified CPPs consisting of, consisting essentially of, or conqnising an amino 
acid sequence selected fi-om one of the groups consisting of SEQ ID NOs: 1-S, 6-10, 1 1-14, 15-23 and 
24-28. Further included are unmodified pzecursors, proteol3rtic precursors and intermediates of the 
sequence selected fi^m one of the groups consisting of SEQ ID NOs: 1-5, 6-10, 1 1-14, 15-23 and 24- 
28. 

The present invention embodies isolated, purified, and recombii^t polypeptides con^irising 
a contiguous span of at least 3 amino acids, preferably at least 8 to 10 amino acids, with a CPP 
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biologicai activity. In preferred embodim^ts the contigaous stretch of aniino acids comprises the site 
of a nmtatioii or fimctional mutation^ including a deletion, addition, swap or truncation of the amino 
acids in the CPP sequence. The invention also ccmcrans &e polypeptide racoded by the CP? 
nucleotide sequences of &6 invention, or a complemmtaiy sequence thereof or a fragment Hxmof. 

One aspect of the invmtion pertains to isolated CPPs, and biologically active portions thereof, 
as well as polyp^tide fragments suitable for use as inununogens to raise anti-CPP antibodies. In one 
embodiment, native CPP peptides can be isolated from plasma, cells or ti^ue sources by an 
appropriate purification scheme using standard protein purification techniques, hi another 
embodiment, CPPs are produced by recombinant DNA techniques. Alternative to recombinant 
expression, a CPP can be synthesized chemically using peptide synthesis techniques, as described in 
the section titled **Chemical Manu&cture of CPP conqsositions" and in Exanq)le 2. 

Typically, biologically active portions comprise a domain or motif with at least one activity of a 
CPP. A biologically active CPP may, for example, comprise at least 1, 2, 3, or 5 amino acid changes 
fixm the sequence selected from one of the groups consisting of SEQ ID NOs: 1-5, 6-10, 1 1-14, 15-23 
and 24-28, or comfwise at least 1%, 2%, 3%, 5%, 8%, 10% or 15% change m amino acids from the 
sequence selected from one of the groins consisting of SEQ E> NOs: 1-5, 6-10, 1 1-14, 15-23 and 24- 
28. 

Characterization of CPPs 

The polypeptides of the invention, CPPs, are defined by the tryptic peptides of SEQ ID NOs: 
3-5, 8-10, 13-14, 18-23 and 26-28, listed in Table 1. These peptides were isolated from the plasma of 
Coronary Artery Disease patimts and characterized according to the MicroProtTM method, as 
desCTibed in Exanq)le 1. SEQ ID NOs: 1-2, 6-7, 11-12, 15-17, and 24-25 represent the polypeptide 
species found in CAD plasma from which the tiyptic peptides were released 

The CPPs of the invention are aU less than or around 20kD in molecular weight, as fee plasma 
sample is first s^>arated based on molecular weigjit Higher molecular weight polypeptide species are 
separated and characterized by a different metiiod. As described in Example 1, the plasma sanq}le is 
subjected to a number of chromatography separations. Details about these chromatography methods 
are given in Example 1. 

The first separation is on a cation ^change chromatography column, which is eluted wife 
nicreasmg salt concentration. Eighte^ fractions are collected. The CEX column in Table 1 lists 
which fractions ccmtained each tryptic peptide, as well as its elution conditions. Sqiaration by cation 
exchange provides an indication of the overall positive charge of a polypeptide species. Cation 
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exchange is followed by a reverse phase HPLC separation. The KPl column in Table 1 lists in which 
of the 30 fractions each tryptic p^tide eluted, as weU as its elution conditions. Separation by reverse 
phase provides an indication of the overall hydrophobicity of a polypeptide species. The last two 
digits of &e colunm labeled Run Number indicate which of the 24 eluted fractions contained the 
tiyptic peptide from the second reverse phase HPLC separation (see Exanq^le 1). Proteome indicates, 
for CPP 17, in which of the sample each tryptic peptide was observed. For CPP2, CPP 9. CPP 20 and 
CPP 21, all the observations wore made solely in the Diseased (CAD) sanq)le. Olav scores, when 
present, reflect, among other things, the strength of the e^q^erimental MS-MS signal over noise, as 
detected by the MS-MS data identification software, and dius give an indication of the protein 
ccmcentration in &e sample. 

Tablela-CPP2 



CPP# 


Sequence 1 CEX 


Salt 


RP1 


%B 


Run 


CPP 2 


SNCCQHSSALGLAR 


1 10 

i 


175 mM 


9 


35.7 


89870^13 


SNCCQHSSALGLAR 


10 


175 mM 


9 


35.7 


89870^14 


CTSMASENSECSVK 


10 


175 mM 


9 


35.7 


89870_13 


CTSMASENSECSVK 


10 


175 mM 


9 


35.7 


89870.14 


CPP# 


JT > 

Sequence 




o 

CEX 




Salt 


RP1 


%B 


Run 


CPP 9 
Cluster 1 


DPPQYPWPVHLDR 


2 


75 mM 


8 


33.7 


84158_22 


DPPQYPWPVHLDR 


2 


75 mM 


8 


33.7 


84158_22 


DPPQYPWPVHLDR 


2 


75 mM 


8 


33.7 


84158_22 


DPPQYPWPVHLDR 


2 


75 mM 


8 


33,7 


84158J22 


DPPQYPWPVHLDR 


2 


75 mM 


8 


33.7 


84158_22 


RDPPQYPWPVHLDR j 2 


75 mM 


8 


\ 33.7 


84158.22 


RDPPQYPWPVHLDR 


2 


75 mM 


8 


33.7 


84158.23 


YAQTPANMFYIVACDNR 


2 


75 mM 


8 


33.7 


84158.19 


YAQTPANMFYIVACDNR 


2 


75 mM 


8 


33.7 


84168.20 


YAQTPANMFYIVACDNR 


2 


75 mM 


8 


33.7 


84158.21 


YAQTPANMFYIVACDNR 


2 


75 mM 


8 


33.7 


84158.22 


YAQTPANMFYIVACDNR 


2 


75 mM 


8 


33.7 


84158.23 


YAQTPANMFYIVACDNR 


2 


75 mM 


8 


33.7 


84158.24 















25 



wo 2004/090551 



PCT/EP2004/003737 





] YAQTPANMFYIVACDNR ! 3 


75 mM 


id 


37.5 


o4274_1 3 


CPP9 
Cluster 2 


^ YAQTPANMPiflVAGDNR 


3 


75 mM 


11 


39.4 


84286_14 


YAQTPANMFYIVACDNR 


3 


75 mM 


11 


39.4 


o42oo_1o 


RDPPQYPWPVHLDR 


4 


75 mM 


10 


37.5 


88113_12 


YAQTPANMFYIVACDNR 


4 


75 mM 


10 


37.5 


88113__12 


YAQTPANMFYIVACDNR 


4 


K it 

75 mM 


10 


37.5 


88113_13 


YAQTPANMFYIVACDNR 


5 


75 mM 


1 1 i! 39.4 


101 105_13 










- 






* 

CPP9 

Cluster 3 | 


YAQTPANMFYIVACDNR 


8 


100 mM 


12 


1 41.3 


9571 3_07 


YAQTPANMFYIVACDN R 


9 


175 mM 


11 


39.4 


105000^10 


YAQTPANMFYIVACDNR 


9 


175 mM 


11 


39.4 


105000.13 


RDPPQYPWPVHLDR 


9 


175 mM 


12 


41.3 


106012_08 


YAQTPANMFYIVACDNR ] 9 


175 mM 


12 


41.3 


105012_08 


YAQTPANMFYIVACDNR ] 9 175 mM 

1 i 


12 


41.3 


105012_09 



FcH- GPP 9» the information provided in Table lb reveals a number of characteristics of the 
polypeptide species pr^ent in plasma of CAD patients. The following represent ^censplaiy 
intrapretations of the data contained in Table lb. Table lb shows that the same tr3rptic peptide is 

inoicates mat me tt^rpac pepaoe is feieasea sfom eaii^nent circmaang poiypepnde species, t or 
exanaple» CFP 9 was obs^i^ in a continuous set of CEX fractions from 2 to 5, and then again in 
another continuous set of CEX fractions from 8 to 9. Hiis indicates the existence of at least two 
polypeptide species from which the tryptic pq>tides were derived, which possess difTerent positive 
charge propoiies. This may indicate the presence of additional amino acids or a charged modification 
(e.g., phosphate group) in one ^cies and not the other. Similarly, CPP 9 was observed in RPl 
fraction 8, and &qi in a continuous set of RPl fractions from 10 to 12. Once again, this indicates the 
existence of at least two polypeptides species from which the tryptic peptides were derived, which 
possess different hydrophobic indexes. Taken together, these observations indicate that the observed 
peptide &11 into three non-sequence>specific elution clusters: the first elution cluster includes 
fragments eluted in C£X fraction 2 and RPl fraction 8; the second cluster includes fragments eluting 
in CEX fractions 3-5 and RPl fractions 10-1 1; and the third includes fragments eluting in CEX 
fractions 8-9 and RPl fractions 1 1-12. Based on tiie combined fraction information for the tryptic 
pqitid^ of SEQ ID NOs:3-5, it is likefy that three different forms or species of CPP 9 circulate in 
plasma of individuals with CAD. These CPP 9 forms likely differ in post-translational modification 
and/or amino acid length. The tiyptic peptides of SEQ ID NOs:8-10 are derived fix>m the polypeptide 
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of SEQ ID NO: 7, which is processed to form the different polypeptide species presoit in tiie plasma 
of individuals witii CAD. 

Table Ic-CPP 21 

5 



CPP# 


Peptide Sequence 


1 CEX 


Salt 


RP1 


%B 


1 Run Number 


CPP 21 

u 

1 


AWHGILMGVPVPFPIPEPDGCK 


' 3 


76 


13 


43.27 


8431 8_1 4 


AWHGfLMGVPVPFPIPEPDGCK 


3 


75 


13 


43^7^ 8431 8_1 5 


A\A/HGILMGVPVPFPIPEPDGCK 


3 


75 


13 


43.27 


84318^1 5 


AWHGILMGVPVPFPIPEPDGCK 


2 


76 


10 


37.5 


84178_22 


AWHGILMGVPVPFPIPEPDGCK 


3 


75 


13 


43.27 


8431 8_1 5 


SGINCPIQK 


3 


75 


13 


43.27 


84318^15 


jSGINCPiQK 


2 


75 

1 


10 i 37.5 

1 1 


84178_22 



Table Id- CPP 17 



Table Id -CPP 17 















1 o.u 


YGTCIYQGR 


6 


100 




33.65 


130804^09 


Control 


12.59 


YGTCIYQGR 


6 


100 


! 8 


33.65 


130804_10 


Control 


20.62 


YGTCIYQGR 


6 


100 


9 


35.7 


130820_10 


Control 


30.84 


IPACIAGER 


7 


\ 100 


7 


31.73 


130782_15 


Control 


23.41 


IPACIAGER 


' 


100 


7 


31.73 


130782^13 


Control 


16 


IPACIAGER 3 7 


100 


7 


31.73 


130782^13 


Control 


12.8 


IPACIAGER 


7 


100 




31 .73 


130782^15 


Control 


11.73 


IPACIAGER 


7 


100 


7 


31.73 


130782_14 


Control 


23.61 


YGTCIYQGR 




1(K) 


7 


31.73 


130782_13 


Control 


44.04 


YGTCIYQGR 

L— 


7 


100 


7 


31.73 


130782^13 


Control 


52.39 


YGTCIYQGR 


7 


100 


7 


31.73 


130782^14 


Control 


49.92 


YGTCIYQGR 


7 


100 


7 


31.73 


130782_14 


Control 


47.62 


YGTCIYQGR 


7 


100 


7 


31.73 


':30782_14 


Control 


36.78 


YGTCIYQGR 


7 


100 


7 


31.73 


130782 J 2 


Control 


42.07 


YGTCIYQGR 


7 ' 100 




7 


31.73 


130782.15 1 


Control 


43.83 


IPACIAGB^ 


/ 


100 [ 


8 


33.65 


130808 10 . Control 

- - « 


27.5 
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Table 1d-CPP17 


Peptide Sequence 


CEX 


Salt 


RP1 


%B 


Run 


Proteome 


Olav Score 


IPACIAGER 


7 


1 100 


8 


33.65 


130808.08 


Control 


25^ 


IPACIAGER 


7 


100 


8 


33.65 


130808_11 


Control 


24.8 


IPACIAGER 


7 


j 100 


8 


33.65 


130808.09 


Control 


23.61 


YGTCIYQGR 


7 


100 


8 ^ 33.65 


130808.08 


Control 


42.42 


IPACIAGER 


7 


100 


10 


37.5 


130836_07 


Control 


29.46 


YGTCIYQGR 


7 


100 


10 


37.5 


130836.07 


Control 


48.61 


IPACIAGER 


7 


100 


12 


41.34 


130868.12 


Control 


15.48 


IPACIAGERR 


8 


100 


8 


33.65 


121059 08 


Control 


24.97 


IPACIAGER 


8 


100 

J 


8 


33.65 


121059.11 


Control 


21.05 


RYGTCIYQGR 


8 


100 


8 ^ 33.65 


121059.08 


Control 


46.79 


RYGTCIYQGR 


8 


100 


8 


33.65 


121059^08 


Control 


35.9 


YGTCIYQGR 


8 


100 


8 


33.65 


121059.11 


Control 


28.98 


IPACIAGER 


8 


100 


9 


35.7 


121071.08 


Control 


5 

12.3 


IPACIAGER 




100 


9 


35.7 i 


— — — — — ^— ^— 
! 121071^08 


Control 


28.34 


IPACIAGER 


8 


, 

100 


9 


35.7 


121071.11 


Control 


28.63 


IPACIAGER 


8 


100 J 9 


35.7 


121071.07 


Control 


12.19 


IPACIAGER 


8 


lOOl 


1 9 


35.7 


121071.09 


Control 


33.23 


YGTCIYQGR 


8 


100 


9 


35.7 


121071.10 


Control 


34.47 


! YGTCIYQGR 


8 


100 


9 


35.7 


121071_08 


Control 


41.24 


YGTCIYQGR 


8 


100 


9 


35.7 


121071.11 


Control 


52.02 


YGTCIYQGR 


8 


100 


9 1 


35.7 


121071 09 1 


Control 


53.73 


YGTCIYQGR 


8 1 


100 


-9^ 


35.7 


121071^07 j Control 


35.05 


YGTCIYQGR 


8 


100 


9 


35.7 


121071^12 


Control 


25.16 


LWAFCC 


8 


100 


9 


35.7 


121071.09 


Control 


17.74 


IPACIAGER 


8 


100 


10 


37.5 


121091_08 


Control 


25.38 


YGTCIYQGR 


8 


100 


10 


37.5 


'121091.07 


Control 


42.97 


YGTCIYQGR 


8 


100 


10 


37.5 


121091.08 


Control 


45.46 


YGTCIYQGR 


8 


100 


10 


37.5 


121091_09 


Control 


47.15 


IPACIAGER 


8 


1(K) 


11 


39.42 


121095.07 


Control 


26.69 


IPACIAGER 8 


•:oo 


11 


39.42 


121095.08 


Control ' 


32.28 


IPACIAGER 


8 


100 


11 i 39.42 


121095^09 


Control j 


27.02 


YGTCIYQGR 


8 


100 


11 ^ 39.42 


121095.09 


Control 


42.21 


YGTCIYQGR 


8 


100 j 11 


39.42 1 


121095.14 J Control 


16.66 


YGTCIYQGR 


8 


100] 


11 


39.42 1 121095_09 [ Control 


36.49 



28 



wo 2004/090551 PCT/EP2004/003737 



Table Id -CPP 17 



Peptide Sequenpe 


CEX 


Salt 


RP1 


%B 1 Run 


1 Proteome 


Olav Score 


YGTCIYQGR 


8 


100 


11 


39.42 


: 121095J07 


Control 


42.61 


YGTCIYQGR 


8 


100 


• * 

11 


39.42 


121095_08 


Control 


36.09 


LWAFCC 


8 


^ ^^^^ 

100 


11 


39.42 


121095^08 


Control 


17.02 


LWAFCC 


8 


100 


11 


39.42 


121095^07 


Control 


20.09 


IPACIAGER 


8 


100 


12 


41.34 


121115.12 


Control 15.91 

1 


IPACIAGER 


8 


100 


12 


41.34 


121115_05 


Control 


29.01 


IPACIAGER 


8 


100 


12 


41.34 


121115_07 


Control 


28.02 


IPACIAGER 


8 


100 


12 


41.34 


121115J06 


Control 


30.2 


YGTCIYQGR 


8 


100 


12 


41.34 


121115_09 


Control 


27 


YGTCIYQGR 


8 


100 


12 


41.34 


121116_07 


Control 


48.94 


YGTCIYQGR 


8 


100 


12 


41.34 


121115_06 


Control 


37.44 


YGTCIYQGR 


8 


100 


12 


41.34 


121115^05 


Control 


57.97 


IPACIAGER 


8 


100 


13 


43.27 j 121127^08 


1 Control 


30.44 


IPACIAGER 


8 


100 13 


43.27 j 121127^05 


Control 


29.38 


IPACIAGER 


8 


100 


13 


43.27 , 121127^07 


Control 


24.45 


YGTCIYQGR 


8 


100 


13 


43.27 


121127_06 


Control 


48.37 


YGTCIYQGR 




100 


13 


43,27 


121127_05 


Control 


50.08 

1 


IPACIAGER 


9 


175 


7 


31.73 


130774^13 


Control 1 


25.57 


IPACIAGER 


9 


175 


7 


31.73 


130774.11 


Control 


25.79 


IPACIAGER 


9 


175 7 


31.73 


130774.13 


Control 


28.54 


IPACIAGER 


9 


175 


7 


31.73 


130774.12 


Control 


11.1 


IPACIAGER 


9 176 


7 


31.73 


130774.12 


Control 


31.51 


RYGTCIYQGR 


9 


175 


7 


31.73 


130774.13 


Contrd 


18.08 


YGTCIYQGR | 


9 


175 


7 


31.73 


130774^13 


Control 


48.73 


YGTCIYQGR 


9 


175 


7 


31.73 


130774.13 


Control 


48.83 


To 1 OlYVJoK 


9 


175 


7 


31 .73 


130774.11 


Control 


42.52 


YGTCIYQGR 


9 


175 


7 


31.73 J 130774_12 


Control 


61.96 


YGTCIYQGR 


9 


175 j 7 


31.73 [ 130774^14 


Control 


60^5 


YGTCIYQGR 


9 


175 


7 


31.73 130774_12 j Control 


44.13 


YGTCIYQGR 


9 


175 


7 


31.73 


130774.10 


Control 


14.34 


LWAFCC 


9 


175 


7 


31.73 


130774.12 


Control 


18.2 


IPACIAGER 


9 


175 


8 


33.65 


130786.11 


Control 


24.81 


IPACIAGER 


9 


175 


8 


33.65 


130786.08 


Control 


21.49 


YGTCIYQGR 

< 


9 1 175 


8 


33.65 


130786.08 


Control 


19.16 
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Peptide Sequence 


CEX 


salt 


RP1 


' %B 


Run 


Profeome 


Olav Score 


IPACIAGER 


9 


175 


12 


41.34 


130860.10 


Control 


31.05 


IPACIAGER 


9 


175 


12 


41.34 


130860_11 


i Control 


29.74 


YGTCIYQGR 


9 


175 


12 


41.34 


130860_10 Control 


«}.91 


YGTCIYQGR 


9 


175 


12 


41.34 


130860.11 Control 40.12 


IPACIAGER 9 

\ 


175 


13 


43.27 


130876.10 


Control 14.94 


IPACIAGER 9 


175 


13 


43.27 


130876_12 


Control 


33.64 


YGTCIYQGR 


9 


175 


13 


43.27 


130876.12 


Control 


42.65 


1 YGTCIYQGR 


9 


175 


13 43.27 


130876.10 


Control 


25.55 


YGTCIYQGR 


9 


176 


13 


1 43.27 


130876.12 


Control 


26.9 


IPACIAGER 


10 


175 


8 


33.65 1 110332„08 


Control 


25.99 


IPACIAGER 


10 


175 


8 


33.65 ; 110332_11 


Control 


34.38 


IPACIAGER 


10 


175 > 8 


33.65 


110332.12 


Control 


13.51 


IPACIAGER 


10 


175 


8 


33.66 


110332.09 


Control j 13.77 


YGTCIYQGR 


10 


175 


8 


33.65 


110332_10 


Control 


23.53 


YGTCIYQGR 


10 


175 


8 


^ 33.65 


110332.11 


Control 


43.26 


YGTCIYQGR 


10 


176 


8 1 33.65 


1 1 0332.08 


Control 


20.07 


YGTCIYQGR 


10 


175 


9 

- 


! 35.7 

- 


110348.09 


Control 


47.8 


YGTCIYQGR 


11 


175 ! 8 


33.65 : 110336.12 


Control 


33.38 


YGTCIYQGR 


11 


175 


8 


33.65 ' 110336 11 


Control 


45.59 


YGTCIYQGR 


11 


175 


8 


33.65 


110336.12 


Control 54.61 

•1 


IPACIAGER 


11 


175 


9 


35.7 


110344.10 


Control 


21.1 


IPACIAGER 


11 


175 


' 


35.7 


110344_09 


Control 


32.42 


YGTCIYQGR 


11 


175 


9 


35.7 


110344.09 


Control 


46.58 


IPACIAGER 


12 


175 


8 


33.65 


121083.08 


Control 


33.79 


YGTCIYQGR 


12 


175 


8 


33.65 


121083_08 


Contro) 


49.15 


YGTCIYQGR 


13 


175 


9 


35.7 


110352.C» 


Control 


27.99 i 


IPACIAGER 


14 


175 


7 


31.73 1 

>> 


117862.13 


Control 


33.6 i 


YGTCIYQGR 


14 


175 


7 


31.73 117862^13 


Control 


28.57 


IPACIAGER 


4 


75, 8 


33.65 


88177_09 CAD 


15.34 


IPACIAGER 


5 


76 


9 


35.7 


101089^16 


CAD 


14.38 


IPACIAGER 


5 


75 


9 


35.7 


101089_11 


CAD 


18.62 


YGTCIYQGR 


5 


75 


9 


35.7 


101089^12 


CAD 


39.54 


IPACIAGER 


6 


100 


8 


33.65 


111981.09 


CAD 


26.89 


IPACIAGER 


6 


100 


9 


35.7 


100782_09 


CAD 


14.96 
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Table Id -CPP 17 



PeptidB SequGnce 


CEX 


Salt 






Run 


ProtBome 


Olav Score 


YGTCrVQGR 


6 


100 


g 


35 7 

WW* / 


100782 10 

1 WW r 1 w 


CAD 


12.18 


YGTCIYQGR 


6 




Q 


35.7 


100782J1 


CAD 


36.7 


IPACtAGER 


7 

V 


100 


a 


33.65 


108099_11 


CAD 


26.83 


IPACtAGER 


7 


100 


ft 


33.65 


108099^09 


CAD 


22.09 


iPACIAGER 


7 


100 




33.65 


108099.12 


CAD 


13.52 


IPACIAGER 


7 


100 


8 


33.66 


108099.10 


CAD 


25.05 


RYGTCI YOG R 


7 


inn 


8 


33.65 


108099.09 


CAD 29.38 


YGTCIYOGR 


7 


inn 


8 


33.65 


108099.08 


CAD 


49.46 


YGTCIYOGR 


7 


100 


8 


1 33.65 


108099.12 


CAD 

IT 


33.68 


YGTCIYOGR 


7 


100 


8 


33.65 { 108099.09 1 CAD 


46.69 


YGTCIYOGR 


7 
• 


100 


8 


33.65 


108099.11 


CAD 


33.65 


IPACIAGER 


7 


100 


9 


35.7 


108103.11 


CfiD 


18.78 


IPACIAGER 


7 


100 


9 


35.7 


1(^103.08 


CAD 


26.41 


iPACIAGER 


7 


100 


9 1 35.7 


108103.10 


CAD 


28.48 


IPACIAGER 


7 


100 


9 


35.7 


108103.09 


CAD 


29.11 


YGTCIYOGR 


7 


100 


9 


35.7 1 108103J39 


CAD 


47.95 


YGTCIYOGR 


7 


100 




35,7 j 108103_11 


CAD 


35.85 


YGTCfYOGR 


7 


inn 


9 


35.7 


108103.10 


CAD 


• 51.77 


iPACIAGER 


7 


inn 


10 


37.5 


105004.07 


CAD 


15.75 


YGTCIYOGR 


7 

f 


inn 


10 


37.5 


105004.08 


CAD 


43.36 


YGTCIYOGR 


7 
f 


inn 


10 


37.5 


105004.07 


CAD 


44.2 


IPACIAGER 


7 


inn 


11 


39.42 


104992.08 


CAD 


31.77 


YGTCIYOGR 


7 


inn 


11 


39.42 


104992.07 


CAD 


33.44 


YGTCIYOGR 


7 


inn 

lUU 


11 


39.42 


104992.08 


CAD 


28.08 


IPACIAGER 


7 


inn 


12 


41.34 


105020.07 


CAD 


16.66 


YGTCIYOGR 


7 . 
i 


100 


12 


41.34 


105020.07 


CAD 


13.61 


IPACIAGER 


8 


100 


8 


33.65 


98379.16 


CAD 


15.58 


IPACIAGER 


8 


100 


8 


33.65 


98379.1 1 1 


CAD 


25.88 


IPACIAGER 


8 


100 


8 


33.65 


98379_08 


CAD 


24.81 


IPACIAGER 


8 


100 


8 


33.65 


98379.12 


CAD 


26.89 


RYGTCIYQGR 8 

i 


100 


8 


33.65 


98379.09 


CAD 


38.01 


RYGTCIYQGR ! 


8 


100 


8 


33.65 


98379.09 


CAD 


35.93 


YGTCIYOGR 


8 


100 




33.65 


98379.13 


CAD 


32.74 


YGTCIYQGRLWAFCC 


1 


100 


8 


33.65 


98379.08 


CAD 


49.34 
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Table Id -CPP 17 





wBA 


Salt 1 RP1 


%B 


Run 


Proteome 


MiaV oCOrB 


T V? 1 T won. 


Q 
O 


100 


8 


33.65 


98379_12 


CAD 


10.42 




Q 

o 


100 


8 


33.65 


98379_09 


CAD 


60.78 




D 


100 


8 


33.65 


98379^08 


CAD 


47.12 


Yf5TP fvnft R 


o 
o 


100 


8 


33.65 


98379 08 1 CAD 


46.31 


VftTf! IVfift R 

T V3 1 Vrf 1 T w wf^ 


Q 

o 


100 


8 


33.65 


98379_10 


CAD 


16.91 


Yf5TP IVr»f5 R 


Q 
O 


100 


8 


33.65 


98379_10 


CAD 


36.03 




Q 
O 


100 


8 


33.65 


98379_1 1 


CAD 


A f\ eft 

40.69 


1 Y^^TPIYOfSRI WAPOr* 


Q 
O 


100 


1 8 * 33.65 


98379_08 


CAD 


41.85 


YGTCIYQGR 


8 


100 


8 


33.65 


98379_07 


CAD 


38.06 


Yr^TPiYriOR 


Q 
O 


100 


8 


33.65 


98379_12 


CAD 


52.75 


LVVMr 


t ft 
1 


100 


8 


33.65 


98379_11 


CAD 


17.02 




a 
O 


100 


8 


33.65 


98379^08 


CAD 


20.09 




o 
O 


100 


8 


33.65 


98379.09 


CAD 


20.09 




o 
O 


100 


8 


33.65 


98379_12 


CAD 


21.28 


IPAPIAAPR 


a 
O 


100 


9 


35.7 


98395_16 


CAD 


29.25 




Q 

O 


100 


9 


35.7 


98395.12 


CAD 1: 21 .9 






100 


9 


35.7 


98395.12 


CAD I 17,76 


IPACIAGER 


8 


100 


9 


35.7 


98395.12 


CAD 


20.93 ; 


IPACIAGER 


8 


100 


9 


35.7 


98395.17 


CAD 


28.01 


IPACIAGER 


8 


100 


9 


35.7 


98395.09 


CAD 


23.18 


IPACIAGER 


8 


100 


9 


35.7 


98395.09 


CAD 


30.94 


IPACIAGER 


8 


100 


9 


35.7 


98395.08 


CAD 


20.52 


IPACIAGER 


8 


100 


9 


35.7 98395^13 


CAD 


25.13 J 


IPACIAGER 


8 


100 




35,7 1 


98395.12 


CAD ' 


30.76 i 


IPACIAGERR 8 


100 


9 


35.7 


98395.11 


CAD 


26.5 


IPACIAGER 1 


8 


100 


9 


35.7 


98395.08 


CAD 


20.09 


IPACIAGER 


8 


100 


9 


35.7 


98395.09 


CAD 


16,16 


IPACIAGER 


8 


100 


9 


35.7 


98395.14 


CAD 


26.79 


RYGTCIYQGR 


8 


100 


9 


35.7 


98395.14 


CAD 


29.34 


RYGTCIYQGR 


8 


100 


9 


35.7 


98395.1 1 


CAD 


27.02 


RYGTCIYQGR 


8 


100 


9 


35.7 


98395.08 CAD 


29.27 


RYGTCIYQGR 


-8—* 


100 


9 


35.7 


98395.10 CAD 


33.39 


RYGTCIYQGR 8 


100 


9 


35.7 98395.12 [ 


CAD 


25.61 


YGTCIYQGR 8 


100 


' 1 


35.7 98395_14 


CAD 


41 .77 
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Table Id -CPP 17 



Peptide Sequence 


CEX 


Satt 


RP1 % B 


Run 


Proteome 


Olav Score 


YGTCIYQGRLWAFCC 


8 


100 


9 


35.7 


98395^08 


CAD 


48 82 


YGTCIYQGR 


8 


100 


9 


35.7 


98395 J 2 


CAD 




YGTCIYQGRLWAFCC 


8 


100 


9 


35.7 


98395.08 


CAD 




YGTCrVOGR 


8 


100 

• ^^^^ 


1 9 

. 


35.7 


98395 J 1 


CAD 




YGTCIYQGR 


8 


100 


9 


35.7 


98395.18 


CAD 


9i Oft 


YGTCrYQGR 


8 


100 


9 


35.7 


98395.13 


CAD 


OO.uO 


YGTCIYQGR 


r-8— 


100 


9 


35.7 


98395_09 


CAD 




YGTCIYQGR 


8 


100 


9 » 35.7 


98395_17 CAD 


00.09 


YGTCIYQGR 


8 


100 


y 


"7 


9o395J0o 


CAD 




YGTCIYQGR 


8 


100 


9 


35.7 


9o395_08 


CAD 


43.61 


YGTCIYQGRLWAFCC 


8 


100 


1 n 

9 


35.7 


98395.12 


CAD 


57.23 


LWAFCC 


8 


100 


9 


35.7 


98395.09 


CAD 


21.28 


LWAFCC t 8 


100 


9 


35.7 


98395.1 2 


CAD 


17.04 


: LWAFCC 


8 


i 100 


9 


35.7 


98395.10 


CAD 


20.09 


IPACIAGER 


8 


100 


10 


37.5 


95685.1 1 


CAD 


27.27 


IPACIAGER 


8 


100 


10 


37.5 


95685_08 


CAD 


24.75 


IPACIAGER 


8 


100 


10 


37.5 


95685.10 


CAD 


16.84 












IPACIAGER 1 




100, 


10 


37.5 


95685_14 


CAD 


24.62 


IPACIAGER 1 


8 


100 


10 


37.5 


95685.09 


CAD 


19.98 


IPACIAGER 


8 ' 


100 


10 


37.5 


95685.13 


CAD 


10.65 


IPACIAGER 




100 


10 


37,5 


95685.06 


CAD 


31.36 


RYGTCIYQGR 


8 


100 


10 


37.5 


95685.06 1 


CAD 


31.07 


RYGTCIYQGR 


8 


100 


10 


37.5 


95685_07 


CAD 


38.34 


YGTCIYQGR 


8 


100 


10 


37.5 


95685_16 


CAD 


20.45 


YGTCIYQGR 


8 


100 


10 


37.5 


95685_14 


CAD 


32.12 


YGTCIYQGR 


8 


100 


10 


37.5 


95685.13 


CAD 


23.06 


YGTCIYQGR 


8 


100 


10 


37.5 


95685.07 


CAD 


38.95 


YGTCIYQGR 8 


100 


10 


37.5 


95686.11 


CAD 


42.46 


YGTCIYQGR 


8 


100 


10 


37.5 


95685.15 


CAD 


19.41 


YGTCIYQGR 


8 


100 


10 


37.5 


95685.06 


CAD 


45.03 


LWAFCC 


8 


100 


10 


37.5 


95685.06 


CAD 


17.74 ; 


LWAFCC 


8 


100 


10 


37.5 


95685.09 


CAD 


15.84 


LWAFCC 


8 


100 


10 


37.5 


95685_08 


CAD 


20.09 


LWAFCC 


8 


100 


10 


37.5 


95685.06 


CAD 


13.49 
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PeDtidfi Seauence 


CEX 


Salt ^ RP1 

i 


%B 


Run 


1 Proteome 


Olav ScoriB 


LWAFCC 


8 


100 


.10 37.5 


95685_07 t CAD 


17 74 


IPACIAGER 


8 


100 


11 


39.42 


1Qu94o_u9 




25 95 


IPACIAGER 


A 


100 


11 


39.42 


10u34o_07 


CAD 


10 92 


IPACIAGER 


8 


100 


11 


39.42 


•4 r\f\f\A O A A 


^ A 
OAU 


1^ 


IPACIAGER 


8 


100 


11 


39.42 


10u94o_09 


CAD 




IPACIAGER 


8 


100 


11 


39.42 


10094o_05 


CAD 


1 


IPACIAGER 


8 


100 


11 


39.42 


100948^08 


CAD 


^/ .1/9 


IPACIAGER 


8 


100 


11 


39.42 


100948_06 


CAD 


97 9R 


IPACIAGER 


8 


100 ' 11 


39.42 


Annf\AO 

1U0948_12 


CAD 


23.02 


IPACIAGER 


8 


100 


1 1 39.42 


1 00948 JD7 


j^ A r% 

CAD 


17 ^57 
1 / .Of 


IPACIAGER 


8 




100 


11 


39.42 


100948_PD 


CAD 


9R fl7 


IPACIAGER 


8 


100 


11 


39.42 


100948_07 


CAD 


97 19 

1 


RYGTCIYQGR 


8 


100 


11 


39.42 


Anf\f\Ar% f\ty 

100948_06 


CAD 




YGTCIYQGR 


8 


100 


11 


39.42 


1 00948__08 


CAD 


"XA ni 


YGTCIYQGR 


8 


100 


11 


39.42 


A A n nn 

100948_09 


CAD 


9Q f\R 


YGTCIYQGR 


8 


100 11 

r 


39.42 


1 00948J05 


CAD 




YGTCIYQGR 


8 


100 j 11 


39.42 


1 00948_06 


CAD 




YGTCIYQGR 


8 


100 


11 


39.42 


100948JD6 


CAD 


^fi Oi i 
'rO.^ 1 j 


YGTCIYQGR ! 8 


100 


11 


39.42 


A AAA A n A A 

1{M^48_06 


CAD 




LWAFCC 


8 


100 


11 


39.42 


.a #%^VA A A A A 

100948 JOB 


CAD 




LWAFCC 


8 


100 


11 


39.42 


100948j07 


CAD 


01 Oil 


LWAFCC 


8 


100 


11 


39,42 


100948_07 


CAD 


on no 

^U.U9 


IPACIAGER 


^^ 


100 


12 


41.34 


9571 3_04 


CAD 


Q 7<« 


IPACIAGER 




100 


12 


41.34 


95713^04 


CAD 




IPACIAGER 


8 


100 




41.34 


95713^06 


CAD 




IPACIAGER 


8 


100 


12 


41.34 


9571 3_04 


CAD 


20.55 


IPACIAGER 


8 


100 


12 


41.34 


9571 3_04 


CAD 


29.89 


IPACIAGER 


8 


100 


12 


41.34 


95713.09 


CAD 


28.65 


IPACIAGER 


8 


100 


12 


41.34 


9571 3_08 


CAD 


13.7 


YGTCIYQGR 


8 


100 


12 


41.34 


9571 3_08 


CAD 


12.11 


YGTCIYQGR 


8 


100 


12 


41.34 


9571 3_04 


CAD 


60.22 


YGTCIYQGR 


8 


100 


12 


41.34 


9571 3_06 


CAD 


55.12 


IPACIAGER 


8 


100 


13 


43.27 95721^04 


CAD 


28.38 


IPACIAGER 


8 " 100 

k 


13 


43.27 1 95721_05 


CAD i 28.84 

1 Lf 
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Peptide Sequence | CEX 


Salt 


RP1 


% B 


Run 


Proteome 


Olav Score 


IPACJAGER 


8 


100 


13 


43^7 


95721_06 


CAD 


35 38 


To 1 OlYUoK 


Q 

D 


100 


13 


43.27 


95721^04 


CAD 


58.93 




o 

O 


100 1 13 


43.27 


95721^05 


CAD 




ID AOl A^CD 

IrAoiAotK 


Q 

CS 


100 


14 


45.2 


957^.02 


CAD 




ir'AUIAofcK 


Q 

cS 


100 


14 


45.2 


95729_04 


CAD 


£ 1 .OH 


ID AOI A/*^CD 


O 


100 


14 


45.2 


95729_03 


CAD 




To 1 OlYUoK 


CI 


100 


14 


45.2 


95729_02 


CAD 


A^ CV7 

*r 1 .\j f 


YGTCIYQGR 


8 


j 100 


14 


45.2 


95729_04 


CAD 




1 VA/ACOr* 


8 


100 


14 


45^ j 95729_03 


CAD 


9.71 


ML> t V MMMn t W I/VMJ 1 
C V V VOL^^VV wCOL^V^IV 


8 


100 


14 


45J2 


95729_06 


CAD 


21 57 




8 


100 


16 


ATA 


100961J)3 


CAD 


14 11 




8 


100 


15 


47.1 


100961_03 


CAD 


37 92 


ii r>w ir\\9 cur% 


9 


175 


8 


33.65 


108107^11 


CAD 


22 35 


1 1 A\w lr%SS CT\ 


9 


175 


8 


33.65 


108107^07 


CAD 


25 09 1 




9 


175 


8 j 33.65 


108107_08 


CAD 


27 93 


IPAniAAPR 


9 . 175 


8 . 33.65 


108107_12 


CAD 


19 97 


IPACiAGER 


9 


175 


8 


33.65 j 108107_09 


CAD 




RYGTCIYQGR 


9 


175 


8 


33.65 


108107_09 


CAD 


*f 1 .90 


RYGTCfYQGR 


9 


175 


8 


33.65 


108107J1 CAD 




RYGTCrv'QGR 


9 


175 


8 


33.65 


1081 07^08 [ CAD 


A A OQ 


RYGTCIYQGR 


9 


176 


8 


33.65 


108107^11 


CAD 


25.03 


iPAClAGER 


9 j 


175 


9 . 

i 


35.7 


108111_13 


CAD 


27.29 


IPACIAGER 


9 ; 175 


9 


35.7 


108111_20 


CAD 


21,68 


IPAClAGER 


9 


176 


9 


35.7 


108111_18 


CAD 


13.34 


IPACIAGER , 9 


175 


9 


35.7 


108111_14 


CAD 


29.32 


IPAClAGER 9 


175 


9 . 35.7 


108111_11 


CAD 


26.21 


IPACIAGER 9 


175 


9 


35.7 


108111_09 


CAD 


28.44 


IPACIAGER 


9 


175 


9 


35.7 


108111^17 


CAD 

.1 


25.43 


IPACIAGERR 


9 


175 


9 


35.7 


108111^08 


CAD 26.44 


IPACIAGER 


9 


175 


9 


35.7 


108111^15 


CAD 


26.58 


IPACIAGER 


9 


175 1 9 


35.7 


108111_10 


CAD 


28.97 1 


IPACIAGER 


9 


175 


9 


36.7 ii 108111_08 


CAD 


25.32 : 


IPACIAGER 


9 ! 

■1 


175 


9 


35.7 S 108111 19 i CAD 

1 


18.61 
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Table Id -CPP 17 



1 Peptide Sequence 


CEX 


Salt 


RP1 


% B 

m IMF 


Run 


Protfiome 


Olav Score 


RYGTCIYQGR 


9 


176 


9 


35 7 


108111 09 


CAD 


46 86 


RYGTCIYQGR 


9 


175 


9 


35 7 


108111 09 


CAD 


36 76 


RYGTCrV^QGR ' 9 


175 ' 9 


35 7 


108111 08 


CAD 


26 49 

&U.*T9 






175 9 


35 7 


108111 13 


CAD 




YGTCIYQGR 


9 


175 


9 


35 7 


108111 14 


CAD 








1 fO 


9 


35.7 


108111_11 


CAD 


17 QR 
1 r .90 




A 

si 


I/O 


9 


35.7 


108111_09 


CAD 


1A R7 




n 
9 


175 


9 


35.7 


108111_09 


CAD 


17 7fi 






I/O 


9 


35.7 

« 


108111J)8 


CAD 


16.32 


1 \Af APr*r* 

UvVMr 


k O 

* y 


1 f O 


9 

So- 


35.7 


108111_11 


CAD 


1A ft7 




Q 
9 


175 




37.6 


104996_08 


CAD 




YGTCIYQGR 


9 


175 


10 


37.5 


104996_07 


CAD 


Af^ n7 


To i W*lYVilV9K 


n 
9 


175 


10 


37.5 


104996^08 


CAD 


'^l 1^ 




9 


175 


10 


37.5 104996_08 


CAD 


A6 




r\ 
9 


175 


10 


37.6 1 104996 J)7 


CAD 


1^ AO 


t 


Q 

9 


175 


10 


37.5 


. 104996^07 


CAD 


15.84 


IDAr^lA/T&ETD 


n 
9 


175 




39.42 


105000_08 


CAD 


7Q 

* 




Q 

9 


175 


11 


39.42 


105000_08 




Ol .01 


1 Airr^o 

LVVMrVi^O 


9 


176 


11 


39.42 


106000_08 


PAD 


i R Ol 




9 


176 


12 


41.34 1 06012^07 






iPAuiAGcK 


9 


1 f o 


12 1 


41 .34 


105012_07 


oAn 




YGTCIYQGR 


Q 

9 


176 


12 


41.34 


105012_06 






ADEVAAAPEQIAADJP 
EWVSLAWDESLAPK 


9 


175 


12 


41.34 


105012_08 


PAn 


7Q •! 
/O.I 


iPACIAGER 


9 


176 


14 


46.2 


105036^08 






IPACIAGER 


9 


176 


15 


47.1 


1 05028 J)9 


CAD 


31 .M 


IPACIAGER 


9 


175 


16 


47.1 


105028_08 


CAD 


30.44 


IPACIAGER 


10 


176 


7 


31.73 


89838_16 


CAD 


12.17 


IPACIAGER 


10 


176 


7 1 31.73 


89838_18 


CAD 


40.44 


IPACIAGER 


10 


175 


7 1 


31.73 


89a38_21 


CAD 


26.98 


1 YGTCIYQGR 


10 


176 


7 


31.73 


89838.21 


CAD 


13.57 


YGTCIYQGR 


10 


176 


7 


31.73 


89838_16 


CAD 


21.43 


LWAFCC 


10 


175 


7 


31.73 


89838_16 


CAD 


20.09 


IPACIAGER 


10 


175 


8 


33.65 


89854_1 1 


CAD 


33.26 
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Table 1d - CPP 17 . 


Peptide Sequence ' CEX J Salt 


RP1 1 %B 


Run 


Proteome 


Olav Score 


11 /Ax./ 1 /Aw C Fx 


in 


175 


8 ' 33.65 


M854_15 


CAD 


25.36 




in 


175 


8 33.65 


89854_09 


CAD 


30.91 


1 r Mw IMO Cr\ 


in 


175 


8 33.65 


89854_13 


CAD 


27.05 


vrsTr* ivo rs p 

To 1 OITVilOrx 


in 


175 


8 


[ 33.65 


89854_10 


CAD j 29.63 


TVs 1 OITUwrt 


in 


175 


8 


33.65 


89854 09 


CAD 


\ 30.73 


To 1 wiTUorx 


in 

lU 


175 


9 


35.7 ; 89870 10 

* 


CAD i 47.83 




in 


175 


14 


40.2 


89866.08 


CAD 


27.9 


1 rML# IMO cr\ 


1 1 


175 


7 


31.73 


88249_16 


CAD 


33.62 


To 1 L/iTworx 


11 


175 7 


31.73 


88249.17 


CAD 


26.13 


vr^TP Ivors R 

T o 1 wi T wor\ 


11 


175 1 7 


31.73 


88249.16 


CAD 


39.49 


ii /Awl/Aw cr\ 


11 


175 


8 


33.65 


88265.09 


CAD 


39.79 


ipAr^iAf^cp 


11 


175 


8 


33.65 


88265.10 


CAD 


17.77 




11 


175 


8 


I 33.65 


88265.1 3 


CAD 


21.59 


IPACIAGPR 

1 f rW^ IrWS 


11 


175 


8 


33.65 


88266.1 1 


CAD 


16.46 


' IPAHIAr^PR 


11 


175 


9 


35.7 


88277.10 


CAD 


29.52 


1 o 1 wi 1 \rfor\ 


11 


176 


9 


35.7 


88277.09 


CAD 


53.19 


IPAr^lAAPR 
1 r^Mw IMO cr\ 


11 


175 


10 


37.5 

1 ! 


88205 07 

■ 


CAD 


25.87 












^ 11 1 


175 


13 


43.27 


88253.13 J CAD 


29,07 


IPACIAGER 


12 


175 


8 


33.65 


101029.07 ^ CAD 


16.26 


IPACiAGER 


12 


176 


8 , 


33.65 


101029.09 


CAD 


13.61 


IPACIAGER 


12 


175 


8 


33.65 


101029.06 


CAD 


10.92 


YGTCIYQGR 


12 


175 


8 


33.65 


101029.09 


CAD 


36.99 


YGTCIYQGR 


12 


175 


8 j 33.65 


101029.07 


CAD 


31.12 


YGTCIYQGR 


12 


175 


8 


33.65 


101029.07 


CAD 


29.94 


LW/M^CC 


12 


175 


8 


33.65 


101029.07 


CAD 


6^5 


LWAFCC 


12 


175 


8 


33.65 


101029.07 


CAD 


13.49 

A 


IPACIAGER ! 


12 


175 


9 


35.7 


101053.09 


CAD 


20^4 ) 


IPACIAGER \ 12 


175 


9 


35.7 


101053.08 


CAD 


14.04 j 


IPACIAGER j 12 


175 


10 


37.5 


100965.08 


CAD 


26.08 


YGTCIYQGR | 12 


175 


10 


37.5 


100965.08 


CAD 


27.37 


ADEVAAAPEQIAAOIP i 
EWVSLAWDESLAPK 


12 


175' 


12 


41.34 


101001.09 


CAD 


69.47 


ADEVAAAPEQIAAOIP 
EVWSLAWDESLAPK 


12 


175 ' 14 


45.2 ] 


101037.08 


CAD 


58.63 



37 



wo 2004/090551 PCT/EP2004/003737 



i Table Id -CPP 17 



I Peptide Sequence 


CEX 


San 


RP1 


%B 


Run 


Proteome , Olav Score 


IPACIAGER 


13 


175 


8 


33.65 


89762^09 


CAD 


25 42 


iPACIAGER 


13 


175 


8 


33.66 


89762^14 


CAD 


28 91 


YGTCIYQGR 


13 


175 


8 


33.65 


89762_14 


CAD 


37 


YGTCIYQGR 


13 


176 


8 


^ 33.66 


89762_13 


CAD 


37 54 


LWAFCC 


13 


175 


8 , 33.66 


89762„09 


CAD 


14 67 


IPACIAGER 


13 


175 


9 35.7 


89774^14 


CAD 


IS 


IPACIAGER 


13 


175 


Q 


7 

09. f 


89774J3 


CAD 


*tV.O f 


YGTCIYQGR 


13 


175 


Q 




89774_09 


CAD 




YGTCfYQGR 


13 


175 


Q 




89774_10 


CAD 


15.03 


LWAFCC 


13 


176 


Q 


'k^ 7 


89774_09 




1 0.90 


LW/^CC 


13 


! 176 


Q 


7 


89774_10 


CAD 


17.74 


YGTCIYQGR 


13 


175 




Of .O 


89706_10 


API 


*>0 7A 


IPACIAGER 


13 


175 


15 


47.1 


89786_09 






IPACIAGER 


14 


176 


9 


36.7 


92232_10 


r*An 

wML^ 


AO 07 < 


IPACIAGER 


14 


175 


9 


36.7 


92232_1 1 




1 
1 


LWAFCC 


14 


175 


9 


35.7 


92232_1D 




OA Oft 


IPACIAGER I 14 


175 


10 


37.6 


92204^07 


OAn 


07 SkA 


i EWVSLAWDESLAPK 1 


14 


175 


T2 


41.34 


92582_10 




O 1 .uo 


ADEVAAAPEQIAADIP 

, EVWSLAWDESLAPK 


■ 

14 


175 


12 


41.34 


92582^09 




7n A7 


IPACIAGER 


14 


176 


13 


43.27 


92690_06 


HAn 


OK 7^ 


IPACIAGER 


15 


225 


9 


36.7 


9661 7_1 2 




17 ftO 


IPACIAGER 


15 


225 


9 


35.7 


95617_16 




0<t AQ 


IPACIAGER 


15 


226 


9 


36.7 


95617_14 


CAD 




IPACIAGER 


15 


226 


9 


36.7 


9561 7_1 8 


CAD 


14.9 


IPACIAGER 


15 


225 


9 


35.7 1 


9561 7_1 7 


CAD 


21.35 


IPACIAGER 


15 


225 


9 j 35.7 


9561 7_08 


CAD 


21.9 


YGTCIYQGR 


15 


225 


9 


36.7 


95617_16 


CAD 


23.54 


YGTCfYQGR 


15 


225 


9 


35.7 


95617^1 5 


CAD 


44.53 


YGTCIYQGR 


15 ' 


225 


9 


35.7 


95617^10 


CAD 


38.13 


LWAFCC 


-in 


226 


9 


35.7 


9561 7_09 


CAD 


15.3 ! 


LWAFCC 


15 i 225 


9 


35.7 


95617^09 


CAD 


18.93 


YGTCIYQGR 


18 j 1000 


8 


33.65 


101101_08 


CAD 


51.06 
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For CPP 1 7, the ratio of protein levels in CAD versus control plasma ^nq)les is calculated by 
two methods. The first method calculates the CAD/Control ratio by the number of fractions from each 
^onple containing CPP 17. The result is 2.4 (see Table Id). Alternatively, and more accurately, the 
Olav scores obtained for each peptide in the mass spectrometry data analysis software are used to give 
a weighted ratio. This result is 2.1, indicating that the CPPs of the invention are present at 2.1 times 
the level in CAD plasna compared to control plasma. As such, the CPPs provide a useful diagnostic 
tool, wherein an increased level of a CPP indicates an increased risk of develc^ing, or the presence of, 
a cardiovascular disorder. 

Nuxhber of Control Fractions: 1 16 
Total number of CAD Fractions: 279 
Total CAD/ Control: 2.4 



Olav Score Control: 3720^5 
Olav Score Total CAD: 7946.59 
Total CAD/ Control: 2.1 



It is interesting to note that Table Id displays the tryptic pqitide of S£Q ID NO:18 



NO: 16, and which as such is not expected to be found in plasma. Besides, this tryptic peptide was 
identified in diseased plasma only. This obs^ration may reflect an altered processing of &e 
prqTrotein in the case of the disease. 



Table le- CPP 20 



CPP# 

CPP 20 



Peptide Sequence 


CEX 


Salt 1 RP1 


%B 


Run Number 


AFQYHSK 


5 


75 ' 9 

! 


35.7 


101089_11 


CEEDKEFTCR 


10 


1 

175 


9 


35.7 


89870^1 1 


CEEDKEFTCR 


89870_09 


EPLDDYVNTQGPSLFSVTK 


10 


37.5 


89802.08 


CEEDKEFTCR 


89802_08 


CEEDKEFTCR 


89802_08 


EPLDDYVNTQGPSLFSVTK 


11 


175 


10 


37.5 i 


88206_08 


,EPLDDYV^f^QGPSLFSVTK 


88205 08 


CEEDKEFTCR 


88205_08 
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EPLDDYVNTQGPSLFSVTK 



11 



39.42 88221 08 



CPP nucleic acids 

One aspect of the invention pertains to purified or isolated nucleic acid molecules that encode 
CPPs or biologically active portions thereof as furfher described herein, as well as nucleic acid 
fragments thereof. Said nucleic acids may be used for example in titerapeutic (DNA vaccine) and 
diagnostic methods and in drug scre^iing assays as further described herein. 

An object of the invention is a purified, isolated, or recombinant nucleic acid coding for a CPP, 
compl^entaiy sequ^ces thereto, and fragments thereof. Hie invention also pertains to a purified or 
isolated nucleic acid comprising a polynucleotide having a* least 95% nucleotide identity with a 
polynucleotide coding for a CPP, advantageously 99 % nucleotide identity, preferably 99.5% 
nucleotide identity and most preferably 99.8% nucleotide identity with a polynucleotide coding for a 
CPP, or a sequence coii:q)lementaiy thereto oar a biologically active fragment thereof. Anotiter object 
of the invention relates to purified, isolated or recombinant nucleic acids comprising a polynucleotide 
that hybridizes, und^ the stringent hybridization conditions defined herein, with a polynucleotide 
coding for a CPP, or a sequence conq^lem^tary thereto or a variant thereof or a biologically active 
ftaement tbereof . 

that encode a portion or variant of a CPP, wh^in the portion or variant displays a CPP biological 
activity. Preferably said portion or variant is a portion or variant of a naturally occurring CPP or 
precursor thaeof. 

Another object of the invention is a purified, isolated, or recombinant nucleic acid encoding a 
CPP couipi lstng, consisting essentially of, or consisting of the amino acid sequence selected firom one 
of the groups of SEQ ID Nos: 1-5, 6-10, 1 1-14, 15-23 and 24-28, or fragments thereof wherein the 
isolated nucleic acid molecule encodes one or more motifs such as, for CPP 2: a lipase binding region, 
a hydrophobic face, or a disulfide bond; for CPP 9: a substrate KNA-binding site, a substrate virion 
binding site (preferably a RSV virion), a ribonuclease active site, or a disulfide bond; for CPP 21: a 
cholesterol-binding domain, a glycosylation site, or a disulfide bond; for CPP 17: a bacterial 
endotoxin binding site or a disulfide bond; for CPP 20: a kringle-bmding domain or a disulfide bond. 

The nucleotide sequence determined from the cloning of the CPP-mcoding gene allows for 
the generation of probes and primers designed for use in identifying and/or cloning other CPPs (e.g. 
sharing the novel fimctional domains), as well as CPP homologues from other species. 

A nucleic acid fragment encoding a "biologically active portion of a CPP" can be prepared by 
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isolating a portion of a nucleotide sequence coding for a CPP, which oicodes a polypqstide having a 
CPP biological activity, expre^ing the encoded portion of the CPP (e.g., by reconibinant esqiression 
in vitro or in vivo) and assessing the activity of the encoded portion of &e CPP. 

Hie invention furdier enconq}as5es nucleic acid molecules that diff^ from the CPP nucleotide 
sequences of the inv^tion due to degeneracy of the genetic code and oiccde tiie same CPPs of &e 
invention. 

hi addition to the CPP nucleotide sequences described above, it will be appreciated by those 
skilled in the art that DNA sequence polymoiphisms that lead to changes in Hxc amino acid sequences 
of the CPPs may exist within a population (e.g., the human population). Such genetic polymcffphism 
nmy exist among individuals within a population due to natural allelic variation. Such natural alleUc 
variations can typically result in 1-5% variance in the nucleotide sequ^ce of a CPP-encoding gene or 
nucleic acid sequoice. 

Nucleic acid molecules corresponding to natural allelic variants and homologues of the CPP 
nucleic acids of the invention can be isolated based on their homology to tiie CPP nucleic acids 
disclosed herein using the cDNAs disclosed herein, or a pc^on thereof as a hybridization probe 
according to staoidard hybridization techniqi^ und^ stringent hybridization conditions. 

It will be appreciated tl^ die invention comprises polyp^tides having an amino acid 



Uses of CPP nucleic acids 

Polynucleotide sequences (or the conogjlements thereof) encoding CPPs have various 
applications, including uses as hybridization probes, in chromosome and gene mapping, and in the 
genraation of antisense RNA and DNA. hi addition, CPP-encoding nucleic acids are useful as targets 
for pharmaceutical intervention, e.g. for the development of DNA vaccines, and for the preparation of 
CPPs by recombinant techniques, as descnbed herein. The polynucleotides described herein, 
including sequence variants thereof, can be used in diagnostic as^ys. Accordingly, diagnostic 
methods based on detecting the presence of such polynucleotides in body fluids or tissue saixq)le5 are 
a feature of the present invention. £xan:q>les of nucleic acid based diagnostic assays in accordance 
with tiie present invention include, but are not limited to, hybridization assays, e.g., in situ 
hybridization, and PCR-based assays. Polynucleotides, including ext^ded length polynucleotides, 
sequence variants and fragments thereof^ as described herein, rmy be used to generate hybridization 
probes or PCR primers for use in such assays. Such probes and primers will be capable of detecting 
polynucleotide sequences, including genomic sequences that are similar, or conq)lemmtaiy to, the 
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CPP polynucleotides described herein. 

Hie invention includes primer pairs for canying out a PGR to amplify a segment of a 
polynucleotide of the invmtion. Each primer of a pair is an oligonucleotide having a length of 
betwe^ 15 and 30 nucleotides such that i) one prim^ of the pair forms a perfectly matched duplex 
with one strand of a polynucleotide of the inventicm and die other primer of the pair form a perfectly 
match duplex with the complementary strand of the same polynucleotide, and ii) the primm of a pah- 
form such perfectly matched diq>lexes at sites on the polynucleotide that separated by a distance of 
between 10 and 2500 nucleotides. Prefi^ably, the annealing temperature of each primer of a pair to 
its respective con:g>lementary sequence is substantially the same. 

Hybridization probes derived from polynucleotides of die invention can be used, for example, 
in performing in situ hybridization on tissue san^les, such as fixed or frozen tissue sections pr^>ared 
on microscopic shdes or suspended cells. Briefly, a labeled DNA or ENA probe is allowed to bind its 
DNA or RNA target sample in the tissue section on a prepared mi(7oscopic, under controlled 
conditions. Generally, dsDNA probes consisting of the DNA of interest cloned into a plasmid or 
bactCTiophage DNA vector are used for this purpose, aldiougji ssDNA or ssRNA probes may also be 
used. Probes are generally oligonucleotides betweoi about 15 and 40 nucleotides in length. 
Alternatively, the probes can be polynucleotide probes generated by PGR random priming primer 

Lyyii,£Uiy i^i^^&iai iiutiux vU ucLob iJi t u a Mi iUG-pfiJBCa itidU XJC idUDlC U Oy SUy OX A liUiUUCl VI iHOGi 

groups and the particular detection metibod will correspond to the type of label utilized on the probe 
(e.g., autoradiography. X-ray detection, fluorescent or visual microscopic analysis, as appr<^riate). 
Hie reaction can be further amplified in situ using immunocytochemical techniques directed against 
the label of the detector molecule used, such as an antibody directed to a fluorescein moie^ present 
on a fluorescently labeled jnobe. Specific labelmg and in situ detection methods can be found, for 
exanqile, in Howard, G, C, Ed., Methods in Nonradioactive Detection, Appleton & Lange, Norwalk, 
Conn., (1993), herein incorporated by reference. 

Hybridization probes and PCR p r imer s may also be selected from the genomic sequences 
corresponding to the fuU-iengdi proteins identified in accordance with &e present inv^tion, including 
promoter, eolKtncer elements and introns of the gene encoding the naturally occurring polypeptide. 
Nucleotide sequaices encoding a CPP can also be used to construct hybridization probes fen* m^iping 
the gene encoding that CPP and for the genetic analysis of individuals, individuals carrying variations 
of, or mutations in the gene encoding a CPP of the present invention may be detected at the DNA 
level by a variety of techniques. Nucleic acids used for diagnosis may be obtained from a patient's 
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cells, including, for example, tissue biopsy and autopsy material. Genomic DNA may be used directly 
for detection or may be amplified enzymatically by using PGR (Saiki^ et aL Nature 324: 163-166 
(1986)) prior to analysis. RNA or cDNA may also be used for the same puixK)se. As an example, PGR 
prim e r s con^lementaiy to the nucleic acid of the present invention can be used to id^ti^ and 
analyze mutations in die gene of the present invention. Deletions and insertions can be detected by a 
change in size of the amplified product in comparison to the normal genotype. Point mutations can be 
idraitified by hybridizing amplified DNA to radiolabeled RNA of the inv^tion or alternatively, 
radiolabeled antisense DNA sequences of the invention. Sequence changes at specific locations may 
also be revealed by nuclease protection assays, such as RNase and SI protection or the chemical 
cleavage method (e.g. Cotton, et al., Proc, Natl Acad, Sci. USA 85:4397-4401 (1985)), or by 
diflferences in melting temperatures. '^Molecular beacons" (KostriJds L. G. et al., Science 279:1228- 
1229 (1998)), hairpin-diaped, single-stranded synthetic oligonucleotides containing probe sequences 
which are con^lementary to the nucleic acid of the preset invention, may also be used to detect point 
mutations or other sequence changes as well as monitor expression levels of CPPs. 

Oligonucleotide and Antisense Compounds 

Oligonucleotides of the invention, including PGR primers and antisense conq^ounds, are 

Appaea i^iosyscems ti*ostCTi^ity, CA/modet i»OB, S92 or 35^^ UMA/KAA synmesizer, or lilce 
instrument Preferably, phosphoramidite chemistry is employed, e.g. as disclosed in the following 
references: Beaucage and Iyer, Tetrahedron, 48; 2223-23 1 1 (1992); Molko et al, U.S. patent 
4,980,460; Koster et al, U.S. patent 4,725,677; Caruthers et al, U.S. patents 4,415,732; 4,458,066; and 
4,973,679; and the like. For ther^utic use, nuclease resistant backbones are preferred. Many types 
of modified oligonucleotides are available that confo' nuclease resistance, e.g. phosphorothioate, 
phosphorodithioate, phosphoramidate, or the like, described in many refoCTces, e.g. 
phosphorothioates: Stec et al, U.S. patent 5,151,510; BBrschbein, U.S. patent 5,166,387; Bergot, U.S. 
patent 5,183,885; phosphoramidates: Froehlw et al, International application PCTAJS90/03 138; and 
for a review of additional ^>plicable chemistries: Uhlmann and Peyman (cited above). The length of 
the antisense oligonucleotide has to be sufficiently large to ensure that specific binding will take 
place only at the desired target polynucleotide and not at other fortuitous sites. The tqjper range of 
the length is detmnined by several fiictors, including the inconvenience and expense of synthesizing 
and purifying oligomers greater than aboiit 30-40 nucleotides in length, tibe greater tolerance of longer 
oligonucleotides for mismatches than shorter oligonucleotides, and the like. Preferably, ttie antisense 
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oligonucleotide of ^ invention have lengths in the range of about 15 to 40 nucleotides. More 
piefi^ably, the oligonucleotide moieties have lengths in the range of about 18 to 25 nucleotides. 

Prints and probes 

Pruners and probes of the invention can be prep ared by any suitable xnetiu>d, including, for 
example^ cloning and restriction of appropriate sequences and direct chmical synthesis by a method 
such as the phosphodiester method of Narang SA et al (Methods Enzymol 1979;6S:90-98), the 
phosphodiester method of Brown £L et al (Methods &i2^anol 1979;68:109-151), the 
diethylphosphoramidite method of Beaucage et al (Tetrahedron Lett 198U 22: 1859-1862) and the 
solid siqjport method described in £P 0 707 592, the disclosures of which are incorporated herein by 
reference in their raitireties. 

Detection probes are generally nucleic acid sequences or uncharged nucleic acid analogs such 
as, for Qxample peptide nucleic acids which are disclosed in Intenmtional Patent Application WO 
92^20702, rmirpholino analogs which are described in U.S. Patents Nurhbered 5,185,444; 5,034,506 
and 5,142,047. If desired, the probe may be rendered ''non-extendable" in ^t additional dNTPs 
cannot be added to the probe. In and of themselves analogs usually are non-ext^dable and nucleic 
acid probes can be rendered non-extendable by modifying the 3* end of ftie probe such that the 

pxoue can ue iuncaumtiiz.cu wiui uie caputic^A/i AKHCcut^ aujct to uicfc oy cunsuoic ui' uuicrwisc uiucK 
the hydroxyl group. 

Any of the polynucleotides of the present inv^tion can be labeled, if desired, by incorporating 
any label group known in the art to be detectable by spectroscopic, photochemical, biochemical, 
immunochemical, or chemical means. Additional examples include ncm-radioactive labeling of 
nucleic acid fiiagments as described in Urdea et al. (Nucleic Acids Research. 1 1:4937-4957, 1988) or 
Sanchez-Pescador et aL (J. Clin. Microbiol. 26(10):1934-1938, 1988). In addition, the probes 
according to the present invention miay have structural characteristics such that they allow the signal 
aii^)lification, such structural characteristics being, for example, branched DNA probes as those 
described by Urdea et al (Nucleic Acids Symp. Ser. 24: 1 97-200, 199 1) or in the Eim>pean patrat No. 
EP 0225807 (Chiron). 

A label can also be used to capture fbc primer, so as to &ci1itate the immobilization of either 
tiie primer or a primer extension product, such as amplified DNA, on a solid support. A capture label 
is attached to the primers or probes and can be a specific binding member which forms a binding pair 
with the soUd's phase reagent's specific binding member (e.g. biotin and str^tavidin). Therefore 
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depending upon the type of label carried by a polynucleotide or a probe, it may be en^loyed to 
capture or to detect the target DNA. Further, it will be imderstood that the polynucleotides, primers 
or probes provided herein, may, themselves, serve as the capture label. For exanq)le, in the case 
^ere a solid phase reagent's binding member is a nucleic acid sequence, it rmy be selected such that 
it binds a con^lementary portion of a primer or probe to thereby immobilize the primer or probe to 
the solid phase. In cases where a polynucleotide probe itself serves as Ihe binding member, tiiose 
skilled in the art win recognize that the probe will contain a sequence or 'tail" that is not 
complementaiy to the target in &e case v/hsrc a polynucleotide primer itself serves as die capture 
label, at least a portion of the primer will be free to hybridize with a nucleic acid on a solid phase. 
DNA labeling techniques are well known to the skilled technician. 

The probes of the present invention are useful for a number of purposes. They can be notably 
used in Southern hybridization to genomic DNA. The probes can also be used to detect PGR 
amplification products. They may also be used to detect mismatches in CPP-encoding genes or 
rDRNA using other techniques. 

Any of the nucleic acids, polynucleotides, primers and probes of the present inv^tion can be 
conveniently immobilized on a sohd siq)port Sohd supports are known to those skilled in the art and 
include the walls of wells of a reaction tray, test tubes, polystyrene beads, magnetic beads, 

biood^eUs, duracyi^ and otii^. ihesolid support is not cntical and can be selected by one skilled 
in the art Thus, latex particles, raicroparticles, magnetic or non-magnetic beads, membranes, plastic 
tubes, walls of microtiter wells, glass or silicon chips, sheep (or other suitable animaFs) red blood 
cells and duracytes are all suitable examples. Suitable methods for immobilizing nucleic acids on 
solid phases include ionic, hydrophobic, covalent interactions and the Hke. A soHd support, as used 
herein, refers to any material which is insoluble, or can be made insoluble by a subsequent reaction. 
The solid support can be chosen for its intrinsic ability to attract and immobilize the capture reagent 
Alternatively, the soHd phase can retain an additional receptor which has the ability to attract and 
immobilize the c^ture reagmt The additional receptor can include a charged substance that is 
oppositely charged with respect to the capture reagent itself or to a charged substance conjugated to 
the capture reagent As yet another alternative, the receptor molecule can be any specific binding 
mCTiber attached to the solid siQ)port and which has the abiUty to immobilize the capture reag^t 
tiirough a specific binding reaction. The receptor molecule oiables the indirect binding of the capture 
reagent to a sohd siqiport mat^ial before fee performance of fee assay or during fee performance of 
fee asay, Ihe solid phase feus can be a plastic, d^vatized plastic, magnetic or non-nmgnetic metal. 
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glass or silicon surface of a test tube, microtiteT well, sheet, bead, ndi^oparticle, chip, sheep (or o&er 
suitable animal's) red blood cells, duracytes and other configurations known to those of ordinary skill 
in the art Hie nucleic acids, polynucleotides, primers and probes of the invention can be attached to 
or innnobilized on a solid support individually or in groups of at least 2, 5, 8, 10, 12, 15, 20, or 25 
distinct polynucleotides of the invention to a single solid support, in addition, polynucleotides other 
dian tiiose of the invention may be attached to the same solid support as one or more polynucleotides 
of fhc invention. 

Any polynucleotide provided herein may be attached in overlapping areas or at random 
locations on a solid support. Alternatively the polynucleotides of the invention may be attached in an 
ordered array wherein each polynucleotide is attached to a distinct region of the solid support which 
does not overly with the attachment site of any other polynucleotide. Preferably, such an cmiered 
array of polynucleotides is designed to be "addressable" where the distinct locations are recorded and 
can be accessed as part of an assay procedure. Addressable polynucleotide arrays typicafly conqjrise 
a plurality of different oligonucleotide probes fbat are coupled to a sur&ce of a substrate in different 
known locations. The knowledge of the precise location of each polynucleotides location makes these 
"addressable" arrays particularly useful in hybridization assays. Any addressable array technology 
known in fee art can be errq>loyed with the polynucleotides of the inventiorL One particular 

uc&i/iutttaju u*y rattsa; roi pctoiicanons wu yu/iDU/u and y^iuu!^2, the aisciosures ot 

which are incorporated herem by reference in their entireties. 

Methods for obtaining variant nucleic acids and polypq>tides 

In addition to naturally-occurring allelic variants of the CPP sequences that may exist in the 
population, the skilled artisan will appreciate (hat changes can be introduced by mutation into the 
nucleotide sequence coding for CPPs, thereby leading to changes m the amino acid sequence of the 
encoded CPPs, with or without altering the functional ability of the CPPs. 

Several types of variants are conten^lated including 1) one m which one or more of the 
amino acid residues are substituted with a conserved or non-conserved amino acid residue and such 
substituted amino acid residue may or may not be one aicoded by the genetic code, or 2) one in which 
one or more of the amino acid residues includes a substituent group, or 3) one in which the mutated 
CPP is fused with ano&er conq>ound, such as a conqjound to mcrease the half-life of the polypeptide 
(for exanq>le, polyetlqdene glycol), or 4) one in which the additional amino acids are fused to the 
CPP, such as a leader, a signal or anchor sequence, a sequence which is employed for purification of 
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the CPPy Oft sequence from a jHecursor protein. Such variants are deemed to be within the scope of 
those skilled in the art. 

For exan^le, nucleotide siibstitutions leading to amino acid substitutions can be made in &e 
sequences tliat do not substantially change the biological activity of the protein. An amino acid 
residue-can be altered from I3ie wild-type sequence encoding a CPP, or a biologically active fragment 
or homologue th^eof without altering flie biological activity. In general, amino acid residues that are 
shared among the CPPs of the present invention are predicted to be less amenable to alteratioiL 

In another aspect, the invention pertains to nucleic acid n»>lecules encoding CPPs that contain 
changes in amino acid residues ttiat result in increased biological activity, or a modified biological 
activity. Id anoth^ aspect, the invention pertains to nucleic acid molecules encoding CPPs that 
contain changes in amino acid residues that are essratial for a CPP biological activity. Such CPPs 
difiBer in amino acid sequence from SEQ ID NOs: 1-5, 6-10, 1 1-14, 15-23 and 24-28 and display 
reduced activity, or essentially lack one or more CPP biological activities. 

Mutations, substitutions, additions, or deletions can be introduced into any of SEQ ID NOs: 1- 
5, 6-10, 1 1-14, 15-23 and 24-28, by standard techniques, such as site-directed mutagenesis and PCR- 
mediated mutagenesis. For example, conservative amino acid substitutions may be made at one or 
more predicted non-essential amino acid residues. A "conservative amino acid substitution" is one in 

raimuch^ ux rtf i iifHt ir » ;iu i=dMu«ics mmug ^miaar siac fjuama aavc ucca acniica m me an. mese 
femilies include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side 
chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, a^aragine, 
gjutamine, serine, threonine, tyrosine, c^teine), nonpolar side chains (e.g., alanine, valine, leucine, 
isoleucine, proline, phraiylalanine, methionine, tryptophan), beta-branched side chains (e.g., 
tisreonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, 
histidine). Thus, a predicted nonessential amino acid residue in a CPP, or a biologically active 
fragment or homologue thereof may be replaced with another amino acid residue from the same side 
chain family. Alternatively, in another embodiment, mutations can be introduced randomly along all 
or part of a CPP coding sequence, such as by saturation mutagenesis, and the resultant mutants can be 
screened for CPP biological activity to identify mutants that retain activity. Following mutagenesis of 
the nucleotide encoding one of SEQ ID NOs: 1-5, 6-10, 1 1-14, 15-23 and 24-28, the encoded protein 
can be e^qoressed recombinantty and the activity of the protem can be determined in any suitable 
assay, for example, as provided herein. 

The invention also provides CPP chimeric or fusion proteins. As used herein, a CPP "chimeric 
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protein" or "fusion protein" conq)rises a CPP of tiie invaition or fragment thereof; operatively linted 
or fused in fiame to a non-CPP polypeptide sequence, in a preferred embodiment, a CPP fusion 
protein conqjrises at least one biologically active portion of a CPP. In another preferred embodiment, 
a CPP fusion protein comprises at least two biologically active portions of a CPP. For exanq)le, in 
one embodiment, the fusion protein is a GST-CPP fusion protein in which CPP domain i^quences are 
fused to the C-terminus of the GST sequences. Such fusion proteins can ^cilitate the purification of 
recombinant CPPs. hi anodier embodiment, the fusion protein is a CPP containing a heterologous 
signal sequmce at its N-tenninus, for ©cample, to allow for a desired cellular localization in a certain 
host cell, hi yet anotiier embodiment, the fusion is a CPP biologically active fragment and an 
immunoglobulin molecule. Such fusion proteins are useful, for example, to increase the valency of 
CPP binding sites. For example, a bivalent CPP binding site may be formed by fusing biologically 
active CPP fragments to an IgG Fc protein. 

CPP fusion proteins of the invention can be used as immunogens to produce anti-CPP 
antibodies in a subject, to purify CPP or CPP ligands, and in screening assays to identify CPP 
modulators. 

Furthermore, isolated fragments of CPPs can also be obtained by screening pqjtides 
recombinantly produced from the corresponding firagment of the nucleic acid encoding such peptides. 

v</*irw-uti.w*^i i»xwA*Axafc^i4i owxiu^Ai*ii3ca.-*Kiwv^^i b-^M^ ii£astia:>a y , iTorexan^ic, a i^srir oi me presenc 
invention may be arbitrarily divided into fragments of desired length with no overlap of the fragments, 
or preferably divided into overlapping fragments of a desired length- The fragments can be produced 
(recombinantly or by chemical synthesis) and tested to identify those peptidyl fragments with a CPP 
biological activity, for example, by microinjection assays or in vitro protein binding assays, hi an 
illustrative embodiment, peptidyl portions of a CPP, such as a CTP target bmding region, can be 
tested for CPP activity by expression as thioredoxin fusion protems, each of which contains a discrete 
fragment of tiie CTP (see, for exanq>le, U.S. Patents 5, 270,181 and 5,292,646; and PCX pubUcation 
WO94/02502, the disclosures of which are incorporated h^in by reference). 

In addition, hbraries of fragments of a CPP coding sequence can be used to generate a 
variegated population of CPP fragments for screening and subsequent selection of variants of a CPP. 
In one ranbodim^t, a library of coding sequence fragments can be genmted by treating a double 
stranded PCR fragment of CPP coding sequence with a nuclease under conditions wherein nicking 
occurs only about once per molecule, denaturing the double stranded DNA, renaturing the DNA to 
form double stranded DNA which can include sense/antisense pairs from different nicked products. 
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removing single stranded portions from reformed dl^>lexes by treatmeat with SI nuclease, and 
ligating the resulting fragment library into an expre^on vector. By this method, an expressicm library 
can be derived which raicodes N-tenninal, C-terminal and irAemsd fragments of various sizes of the 
CPP. 

Whether a change in ^ anmio acid sequence of a peptide results in a functional CPP 
hon»>log can be readily determined by assessing at least one CPP biological activity of the variant 
peptide. Peptides in which more flian one replacement has taken place can readily be tested in the 
same manner. 



Chendcal Mamrfacture of CPP Compositions 

Peptides of the inv^tion are synthesized by standard techniques (e.g. Stewart and Young, 
Solid Phase Peptide Synthesis, 2nd Ed., Pierce Chemical Conq>any, Rockford, DL, 1984). Preferably, 
a commercial peptide synttiesiiser is used, e.g. Apphed Biosystems, Inc. (Foster City, CA) model 
430A, and polyp^tides of the invention may be assembled from multiple, separately synthesized and 
purified, peptide in a convergent synthesis approach, e.g. Kent et al, U.S. patent 6,184,344 and 
Dawson and Kent, Amm. Rev. Biochem., 69: 923-960 (2000). Peptides of the invention may be 
assembled by solid phase synthesis on a cross-linked polystyrene support starting from the carboxyl 
terminal residue and addine amino acids in a steowise ^shion until the entrre nentide has been 



et al. Int. J. Peptide Protein Res., 40: 180-193 (1992); Merrifield, J. Amer. Chem. Soc., Vol. 85, pg. 
2149 (1963); Kent et ai., pg 185, m Peptides 1984, Ragnarsson, Ed. (Ahnquist and Weksell, 
StocUiolm, 1984); Kmt et al., pg. 217 in Pq>tide Chemistry 84, Izumiya, Ed. (Protein Research 
Foundation, BJI. Osaka, 1985); Merrifield, Science, Vol. 232, pgs. 341-347 (1986); Kent, Ann. Rev. 
Biochrai, Vol. 57, pgs. 957-989 (1988), and references cited in these latter two referaices. 

Preferably, chemical synthesis of polypeptides of the invention is carried out by the assembly 
of peptide fragments by native chemical ligation, as described by Dawson et al. Science, 266: 776-779 
(1994) and Kent el al, U.S. patent 6,184,344. Briefly, in the approach a first peptide fragment is 
provided wifli an N-terminal cysteine having an imoxidized sulfliydryl side chain, and a second 
peptide fiagmmt is provided with a C4enninal thioester. The unoxidized sulfhydryl side clmin of the 
N-tenninal cysteine is then condensed with the C-temunal thioester to produce an intermediate 
pq>tide firagment which links the first and second peptide fragments with a P-aminothioest^ bond. 
The P-aminothioester bond of the intermediate peptide fragment then undergoes an intramolecular 
rearrangement to produce the peptide fragment product which links the first and second peptide 
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fragments with an anride bond. I^eferably^^eN-trnxiinal cysteine of the internal fragments is 
protected from undestred cycliiration and/or concatraiaticm reactions by a cyclic thiazolidine 
protecting group as de^aibed below. Preferably, such cyclic thiazolidine protecting group is a 
thioprolinyl group. 

Peptide fragments having a C-terminal thioester may be produced as described in the 
following refCTences, which are incorporated by reference: Kent et al, U.S. patent 6, 1 84^44; Tam et 
al, Proc. NatL Acad. Sci., 92: 12485-12489 (1995); Blake, lot J. Peptide Protein Res., 17: 273 (1981); 
Canne et al. Tetrahedron Letters, 36: 1217-1220 (1995); Hackeng et al, Proc. Natl. Acad, Sci„ 94: 
7845-7850 (1997); or Hackeng et al, Proc. Natl. Acad. Sci., 96: 10068-10073 (1999). Preferably, the 
method des(^bed by Hackeng et al (1999) is employed. Briefly, peptide fragments are synthesized on 
a solid phase siq>port (described below) typically on a 0.25 mmol scale by using the in situ 
neutralization/HBTU activation procedure for Boc chemistry disclosed by Scfanolzer et al, Iht J. 
Peptide Protein Res., 40: 180-193 (1992), which reference is incorporated herein by inference. 
(HBTU is 2-(lH-benzotriazol-l-yl)-l,l,3,3-te1ramelhyli]romum hexafluorophosphate and Boc is tert- 
butoxycarbonyl). Each synthetic cycle consists of N"-Boc rranoval by a 1- to 2- minute treatment 
with neat TFA, a 1-minule DMF flow wash, a 10- to 20-minute coiq)ling time with 1 .0 mmol of 
preactivated Boc-amino acid in the presence of DIEA, and a second DMF flow wash. (TFA is 

Bec-amino acids (hi tnrm)!) are preacfivated for 3 nrinutes with 1 .0 mmol of HBTU (0.5 M in DMF) 
in the presence of excess DIEA (3 nmiol). After each coiq)ling step, yields are determined by 
measuring residual free amine with a conventional quantitative ninhydrin assay, e.g. as disclosed in 
Sarin et al. Anal. BiochCTL, 1 17: 147-157 (1981). After coupling of Ghi residues, a DCM flow wash 
is used before and after deprotection by using TFA, to prevent possible high-tenq>mture 
(TFA/DMF)-catalyzed pyrrolidone formation. After chain assembly is conapleted, the peptide 
fragments are deprotected and cleaved from tiie resin by treatment with anhydrous HF for 1 hour at 
0**C with 4% p-cresol as a scavenger. The imidazole side-chain 2,4-dinitrophenyl (dnp) protecting 
groups remain on the His residues because the diip^emoval procedure is incompatible with C- 
tCTninal thioester groups. However, dnp is graduaUy removed by thiols during the ligation reaction. 
After cleavage, peptide fragments are precipitated with ice^ld dielhylether, dissolved in aqueous 
acetonitrile, and lyophilized. 

Thioester peptide fragments described above are preferably synthesized on a trityl-associated 
mercaptopropionic acid-leucine (TAMPAL) resin, made as disclosed by Hackeng et al (1999), or 
comparable protocol. Briefly, N*-Boc-Leu (4 mmol) is activated with 3.6 mmol of HBTU in the 
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presence of 6 inmol of DIE A and coupled for 16 minutes to 2 mmol of p-methylbenzbydrylaznine 
(MBHA) resin, or the equivalent Next, 3 mmol of S4rityl merc^topropionic acid is activated with 
2.7 mmol of HBTU in the presence of 6 mmol of DIEA and coupled for 16 minutes to Leu-MBHA 
resin. Tbs resulting TAMPAL resin can be used as a starting resin for polypeptide-chain assembly 
5 after removal of the trityl protecting group with two 1 -minute treatments with 3.5% triisopropylsilane 
and 2 J% H2O in TFA. The thioest^ bond can be fonned with any desired amino acid by using 
standard in situ-neutralization peptide coupling protocols for 1 hour, as disclosed in Schnolzer et al 
(cited above). TreatmOTt of the final peptide fragment with anhydrous HF yields liie G-terminal 
activated morcaptopropionic acid-leucine (MPAL) thioester peptide fragments. 

10 Preferably, thiazolidine-protected thioest^ pq)tide fragment intermediates are used in native 

chemical ligation under conditions as described by Hackeng et al (1999), or like conditions. Briefly, 
0. 1 M phosphate buff^ (pH 8.5) containing 6 M guanidine, 4% (vol/vol) benzybnercaptan, and 4% 
(vol/vol) thiophenol is added to dry pq)tides to be ligated, to give a flnal peptide concentration of 1-3 
mM at about pH 7, lowered because of the addition of thiols and TFA frt)m the lyophilized peptide. 

15 Preferably, &e ligation reaction is performed in a heating block at 37°C and is periodically vortexed 
to equilibrate the ^ol additives. The reaction may be monitored for degree of completion by 
MALDI-MS or HPLC and electrospray ioni^ticm MS. 

Ting of Has product is opened by treatment with a cysteine deprotecting agent, such as O- 
20 methylhydroxylamine (0.5 M) at pH 3.5-4.5 for 2 hours at 37^ C, after which a 1 0-fold excess of Tris- 
(2-carboxyetfayl)-phosphine is added to the reaction mixture to conipletely reduce any oxidizing 
reaction constituents prior to purification of the product by conventional preparative HPLC. 
Preferably, fractions containing the ligation product are idoatifred by electrospray MS, are pooled, 
and lyophilized. 

25 After the synthesis is completed and the final product purified, the final polypeptide product 

may be refolded by conventional techniques, e.g. Creighton, Meth. Enzymol., 107: 305-329 (1984); 
White, Meth. EozyraoU 11: 481-484 (1967); Wetlaufer, Meth. Enzymol., 107: 301-304 (1984); and 
the like. Prefeably, a final product is refolded by air oxidation by the following^ or like: The 
reduced lyophilized product is dissolved (at about 0. 1 mg/mL) in 1 M guanidine hydrochloride (or 

30 like chaotropic agent) with 100 noM Tris, 10 mM methionine, at pH 8.6. After gentle overnight 
stirring, die re-folded product is isolated by reverse phase HPLC with conventional protocols. 

Recombinant Expression Vectors and Host Cells 
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The polynucleotide sequ^ces described herein can be used in recombinant DNA molecules 
that direct the expression of the corresponding polypeptides in appropriate host cells. Because of the 
degeneracy in the genetic code, o&er DNA sequences may ^code the equivalent amino acid 
sequence, and may be used to clone and express the CPPs. Codons preferred by a particular host cell 
may be selected and substituted into the naturally occurring nucleotide sequences, to increase the rate 
and/or efficiency of expression. The nucleic acid (e.g., cDNA or genonuc DNA) encoding the desired 
CPP may be inserted into a rq)licable vector for cloning (amplification of the DNA), or for 
expre^on. The polypeptide can be expressed recombinantly in any of a number of expression 
systems according to methods known in the art (Ausubel, et ai., editors. Current Protocols in 
Molecular Biology, John Wiley & Sons, New York, 1990). Appropriate host cells include yeast, 
bacteria, archebacteria, fungi, and ins^ and animal cells, including mamnmlian cells, for exan^le 
primary cells, including stem cells, including, but not limited to bone marrow stem cells. More 
specifically, these include, but are not hmited to, microorganisms such as bacteria transformed witii 
recombinant bacteriophage, plasmid or cosmid DNA expression vectors, and yeast transformed with 
yeast expression vectors. Also included, are insect cells infected with a recombinant insect virus 
(such as baculovirus), and mammahan expression systems. The nucleic acid sequence to be expressed 
may be inserted into the vector by a variety of procedures. Jn general, DNA is inserted into an 

generally include, but are not limited to, one or more of a signal sequence, an origin of replication, 
one or more marker genes, an rashancer element, a promoter, and a transcription termination sequence. 
Construction of suitable vectors containing one or more of diese components enq>loys standard 
ligation techniques which are known to the skilled artisan. 

The CPPs of the present invention are produced by culturing a host ceU transformed with an 
expression vector containing a nucleic acid encoding a CPP, under the appropriate conditions to 
induce or cause eiq^ression of the protein. The conditions ^)propriate for CPP repression will vaty 
with the choice of the e^qiression vector and the host cell, as ascertained by one skilled in the art. For 
exanq)le, the use of constitutive promoters in the expression vector may require routine optimization 
of host cell growth and proliferation, while die use of an inducible promoter requires die appropriate 
growth conditicms for induction. In addition, in some embodiments, the timing of the harvest is 
important. For example, die baculoviral systen^ used in insect cell expression axe lytic viruses, and 
dius harvest time selection can be crucial for product yield 

A host cell strain may be chosm for its ability to modulate the expression of the inserted 
sequences or to process the expressed protein in the desired fashion. Such modifications of the protein 
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include, but are not linnted to, glycosyl, acetyl, phosphate, amide, lipid, carboxyl, acyl, or 
caibohydrate groi^. Fost-translational processing, which cleaves a "prepro'^ form of the protein, may 
also be inqsoitant for correct insertion, folding and/or function. By way of example, host cells such as 
CHO, HeLa, BHK, MDCK, 293, W138, etc. have specific cellular machinery and characteristic 
mechanisms for such post-translational activities and may be chosen to ensure the correct 
modification and processing of die introduced, foreign protein. Of particular intmst are Drosophila 
melanogaster cells, Saccharomyces cerevisiae and other yeasts, E, coli. Bacillus subtilis, SF9 cells, 
C129 cells, 293 cells, Neurospora, BHK, CHO, COS, and HeLa cells, fibroblasts, Schwanoma cell 
lines, immortalized mammalian myeloid and lymphoid cell lines, Jurkat cells, human cells and other 
primazy cells. 
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relationship with another nucleic acid sequence. For exanq^le, DNA for a pr^equence or secretory 
leader is opeiably linked to DNA for a polypeptide if it is e^qsressed as a preprotein that participates in 
the secretion of the polypeptide; a promote* or enhancer is operably linked to a coding sequence if it 
affects flie transcription of the sequence; or a ribosome binding site is operably linked to a coding 
sequence if it is positioned so as to facilitate translation. Generally, "operably linked" DNA sequences 
are contiguous, and, in the case of a secretory leader or other polypeptide sequmce, contiguous and in 



at convenient restriction sites. If such sites do not exist, the synthetic oUgonucleotide adaptors or 
Imkers are used in accordance with conventional practice. Promoter sequoices encode eith^ 
constitutive or inducible promoters. The promoters may be either naturally occurring promoters or 
hybrid promoters. Hybrid promoters, which combine elements of more than one promoter, are also 
known in the art, and are useful in the present invention. The expression vector may comprise 
additional elements, for example, &e expression vector may have two replication systems, thus 
allowing it to be mai nt a i ned in two organisms, for example in mammalian or insect cells for 
ejqiression and in a procaiyotic host for cloning and an^lification. Both esqiression and cloning 
vectors contain a nucleic acid sequmce that enables the vector to replicate in one or more selected 
host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses. The origin of 
replication ft^om the plasmid pBR322 is suitable for nwst Giam-^iegative bacteria, the 2: plasmid 
origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are 
useful for cloning vectors in mammalian cells. Further, for integrating expression vectors, the 
expression vector contains at least one sequence homologous to the host cell genome, and preferably, 
two homologous sequences which flank the egression construct. The integrating vector may be 
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directed to a specific locus in the host cell by selecting the appropriate hoiiK>logous sequence for 
inclusicm in the vector. Constructs for integrating vectors are well known in the art in an additional 
embodiment^ a heterologous expression control element may be operably linked with the endogenous 
gene in the host cell by homologous recombination (described in US Patents 6410266 and 6361972, 
5 disclosures of which are hereby incorporated by refermce in &eir entireties). This technique allows 
one to regulate expression to a desired level with a chosen control element while ensuring proper 
processing and modification of C^P endogenously expressed by the host cell. UseM heterologous 
expression control elements include but are not limited to CMV immediate early promoter, the HS V 
thymidine kinase promoter, the early and late SV40 promoters, the promoters of retroviral LTRs, such 

10 as those of the Rous Sarcoma Virus (RSV), and metallothionein promoters. 

Prefmbly, &e expression vector contains a selectable marker g^e to allow die selection of 
transformed host cells. Selection genes are well known in the art and will vary with the host cell 
used. Expression and cloning vectors will ^rpically contain a selection gene» also termed a selectable 
marker. Typical selection genes encode proteins that (a) confer resistance to antibiotics or oth^ 

15 toxins, e.g., anipicillin, neomycin, methotrexate, or tetracycline, (b) conq}lement auxotrophic 
deficiencies, or (c) supply critical nutrients not available for fcom complex media, e.g., the g^e 
Qicoding D-alanine racemase for Bacilli. 

conditions suitable for the expression and recovery of the encoded protein fi-om cell culture. The 
20 protein produced by a recombinant cell may be secreted, membrane-bound, or contained 

intracellularly depending on the sequence and/or the vector used. As will be understood by those of 
skill in the art, expression vectors containing polynucleotides encoding the CPP can be designed with 
signal sequences which direct secretion of the CPP through a prokaryotic or eukaryotic cell 
membrane. The desired CPP may be produced recombinantly not only direcfiy, but also as a fiision 
25 polypeptide with a heterologous polypeptide, which may be a signal sequence or other polypeptide 
having a specific cleavage site at the N-terminus of the mature protein or polypeptide. In general, the 
signal sequence may be a component of the vector, or it may be a part of the CPP-«ncoding DNA tiiat 
is inserted into the vector. The signal sequence may be a prokaryotic signal sequence selected, for 
example, from the grotq) of tiie alkaline phosphatase, penicillinase, Ipp, or heat-stable enterotoxin n 
30 leaders. For yeast secretion fhc signal sequence may be, e.g., the yeast invertase leader, alpha factor 
leader (including Saccharomyces and Kluyveronr/ces a-^tor leaders, the latter described in U.S. Pat. 
Ko. 5,010,182), or acid phosphatase l^er, the C. albicms glucoamylase leader (£P 362,179 
published Apr. 4, 1990), or the signal described in WO 901 13646 published Nov. 15, 1990. In 
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mammalian cell expression, mammalian signal sequences may be used to direct secretion of the 
protein, such as signal sequences from secreted polypeptides of the same or related species, as well as 
viral secretoiy ieaders. According to the exfHession system selected, the coding sequence is inserted 
into an appropriate vector, which in turn may require the presence of certain characteristic ''control 
5 elements" or "regulatory sequences.*' Appropriate constructs are known generally in the art (Ausubel, 
et aL, 1990) and, in many cases, are available from commercial suppliers such as invitrogen (San 
Diego, Calif.), Stiatagene (La JoUa. Calif.), Gibco BRL (Rockville, Md.) or Qontecb (Palo Alto, 
Calif.)- 

10 Expression in Bacterial Systetns 

TransformatioD of bacterial cells may be achieved using an inducible promoter such as the 
hybrid lacZ promoter of the "BLUESCRIPr' Phagemid (Stratagene) or "pSPORTl" (Gibco BRL). In 
addition, a number of expression vectors may be selected for use in bacterial cells to produce 
cleavable fusion proteins that can be easily detected and/or purified, including, but not limited to 

15 "BLUESCRIFT" (a-galactosidase; Strats^oie) or pGEX (glutathione S-transferase; Promega, 
Madison, Wis.). A suitable bacterial promote is any nucleic acid sequence enable of binding 
bacterial KNA pol3mierase and initiating the downstream (3 ') transcription of the coding sequence of 

placed proximal to the 5' end of the coding sequence. This transcription initiation region typically 
20 includes an RNA polymerase binding site and a transcription initiation site. Sequmces ^coding 
metabolic pathway enzymes provide particularly useful pronKiter sequmces. Exansples include 
promoter sequences derived from si£gar metabolizing enzymes, such as galactose, lactose and maltose, 
and sequaices derived from biosymhetic en2ymes such as tryptophan. Promoters from bacteriophage 
may also be used and are known in the art In addition, synthetic promote and hybrid promoters are 
25 also use&l; for exan^le, the tat promoter is a hybrid of the trp and lac promote sequences. 

Fur&ermore, a bacterial promoter can include naturally occurring promoters of non-bacterial origin 
that have tiie ability to bind bacterial RNA polymerase and initiate transcription. An efficirat 
ribosome-binding site is also desirable. The expression vector may also include a signal peptide 
sequence that provides for secretion of the CPP in bact^a. The signal sequence typically encodes a 
30 signal peptide comprised of hydrophobic amino acids which direct the secretion of the protein from 
the cell, as is well known in the art. The protein is either secreted into &e growdi media (gram- 
positive bacteria) or into the periplasmic space, located between the inner and out^ membrane of the 
cell (gram-negative bacteria). The bacterial e^qiression vector may also include a selectable marker 
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gene to allow fat the selectioB of bacterial strains that have been transfonned. Suitable selection gmes 
include drug resistance goies such as ampicillin, chlQranq>henicol, erythromycin, kanamycin, 
neomycin and tetracycline. Selectable mark^s also include biosynthetic gmes, such as those in the 
histidine, tryptophan and leucine biosynthetic pathways. When large quantities of CPPs are needed, 
e.g., for die imiuction of antibodies, vectors which direct higji level expression of fusion proteins that 
are readily purified be desirable. Such vectors include, but are not limited to, multifunctional E. 
coli cloning and expression vectors such as BLUESCRIPT (Stratagene), in ^ch the CPP coding 
sequence nmy be ligated into the vector in-fiame with sequences for die amino-terminal Met and the 
subsequCTt 7 residues of beta-galactosidase so that a hybrid protein is produced; PIN vectors (Van 
Heeke & Schuster Chem 264:5503-5509 1989)); PET vectors (Novagen, Madison Wis.); and 
the like. E^nression vectors fOT bacteria include the various components set forth above, and are well 
known m the art. Examples include vectors for Bacillus subtilis, E, coli. Streptococcus cremoris, and 
Streptococcus lividans, among othm. Bacterial esqjression vectors are transformed into bacterial host 
cells using techniques well known in die art, mich as calcium chloride mediated transfecticm, 
electroporation, and others. 

Expression in Yeast 

Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenuia potymorpha, KJuyveromyces 
fragilis and lactis, Pichia guillermondii and P pastoris, Schizosaccharomyces pombe, and 
YcoTowia lipolytica. Examples of suitable promoters for use in yeast hosts include the promoters for 
3-phosphoglycerate kinase (Hitzeman et al., J. BioL Chem. 255:2073 (1980)) or otiier glycolytic 
enzymes (Hess etal., J. Adv, Enzyme Reg. 7:149 (1968); Holland, Biochemistry 17:4900 (1978)), such 
as enolase, glyceraldehyde-3- phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, 
phosphofructokinase, glucose- 6-pho5phate isomerase, 3-phosphoglycCTate mutase, pyruvate kinase, 
triosephosphate isomerase, phosphoglucose isomerase, alpha fector, the ADH2IGAPDH promoter, 
glucokinase alcohol oxidase, and PGH. See, for exanqjle, Ausubel, et al., 1990; Grant et al.. Methods 
in Enzymology 153:516-544, (1987). Other yeast promoters, which are inducible have the additional 
advantage of transcription controlled by growth conditions, include the promoter regions for alcohol 
dehydrogenase 2, isocytochrome C, acid phosphatase, degradative eozymes associated with nitrogen 
metabolism, metallothionein, glyceraldefayde-3-phosphate dehydrogenase, and enzymes responsible 
for maltose and galactose utilization. Suitable vectors and promoters fcft use in yeast expression are 
further d^cribed in EP 73,657. Yeast selectable markers include ADE2. HIS4. LEU2. TRPl. and 
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ALG7, which confers resistance to tunicamyctn; the neomycin phosphotransferase geao, \vhich 
confCTS resistance to G418; and fee CUPl g^e, which allows yeast to grow in the presence of copper 
ions. Yeast expression vectors can be constructed for intracellular production or se<H^on of a CPP 
&om the DNA encoding the CPP of interest For exanq)le, a selected signal peptide and the 
appropriate constitutive or indxicible promoter may be inserted into suitable restriction sites in the 
selected plasmid for direct intracellular expression of the CPP, For secretion of the CPP, DNA 
encoding the CPP can be cloned into the selected plasmid, together with DNA mcoding the promoter, 
the yeast alpha-factor secretory signal/lead^ sequence, and link^ sequences (as needed), for 
expression of the CPP. Yeast cells, can then be transformed with the expression plasmids described 
above, and cidtured in an appropriate fermentation ihedia. Ihe protein produced by such transfoinied 
yeast can then be concentrated by precipitation with 10% trichloroacetic acid and analyzed following 
separation by SDS-PAGE and staming of the gels wi& Coomassie Blue stain. The recombinant CPP 
can subsequently be isolated and purified from the fermentation medium by techniques known to 
those of skiU in the art 



Expression in Mammalian Systems 

The CPP may be expressed in mammalian cells. Mammalian expression systems are known in 

transformed with any of a number of different viral-based expression systems, such as adenovirus, 
where the coding region can be Ugated into an adenovirus transcription/translation conqilex consisting 
of the late promoter and tripartite leader sequence. Insertion in a nonessmtial £1 or £3 region of the 
viral genome results in a viable virus capable of sqiression of the polypeptide of interest in infected 
host cells. A prefrared expression vector system is a retroviral vector system such as is generally 
described m PCTAJS97/01019 and PCT/US97/101048. Suitable m^rrmialia,! expression vectors 
contain a mammalian promote which is any DNA sequence capable of binding mammalian RNA 
polymoase and initiating the downstream (3 ') transcription of a coding sequence for CPP into 
mRNA. A promoter will have a transcription initiating region, which is usuaUy placed proximal to the 
5 ' ead of the coding sequence, and a TATA box, using a located 25-30 base pairs upstream of the 
transcription hritiation site. The TATA box is thought to direct RNA polymerase n to begm RNA 
syn&esis at the correct site. A mammaUan promoter will also contain an upstream promoter elonent 
(enhance- elemait), typically located within 100 to 200 base pairs upstream of the TATA box. An 
upstream promoter element determines the rate at which transcription is initiated and can act in either 
ori^itation. Of particular use as mammalian promoters are the promoters Scorn mammalian viral 
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genes, since flie viral genes are often hig^y expressed and have a broad host range. Examples include 
promoters obtained from tiie genomes of viru^s such as polyoma vinis» fowlpox virus (UK 2,21 1, 
504 published Jul. 5,1989)» adenovirus (such as Ad^ovirus 2), bovine papilloma virus, avian saicoma 
virus, cytomegalovirus, a retrovirus, hepatitis-B virus and Simian Virus 40 (SV40), from heterologous 
mammalj an promot^ns, e.g., the actin promoter or an immunoglobulin promoter, and from heat-shock 
promoters, provided such promoters are compatible witii the host ceD systems. Transcription of DNA 
encoding a CPP by higjher eukaryotes may be increased by inserting an enhancer sequence into the 
vector, ^ihancers are cis-acting elements of DNA, usually about from 10 to 300 bp, that act on a 
promoter, to increase its transcription. Many enhancer sequences are now known from mammalian 
g^es (globin, elastase, albumin, a-fetoprotein, and insulin). Typically, however, one will use an 
enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer, the cytomegalovirus 
early promoter enhancer, ttie polyoma enhancer on the late side of the repHcation origin, and 
admovirus enhancers. Hie enhancer is preferably located at a site 5 ' from the promoter. In general, 
the transcription termination and polyadenylatton sequences recognized by mammalian cells are 
regulatory regions located 3' to the translation stop codon and thus, together with the promoter 
elements, flank the coding sequence. The 3 ' terminus of the mature mRNA is formed by site-specific 
post-translational cleavage and polyadenylation. Exanq^les of transcription terminator and 

rec ombin a nt proteins can be effected in a stable expression system. Expression vectors which contain 
viral origins of replication or endogenous expression elemaits and a selectable marker gene may be 
used for this purpose. Appropriate vectors containing selectable markers frir use in mammalian cells 
are readily available commercially and are known to persons skilled in the art. Exan^les of such 
selectable markers include, but are not limited to herpes sin]q)lex virus thymidine kinase and adenine 
phosphoribosyltiansferase for use in tk- or hprt-cells, respectively. The methods of introducing 
exogenous nucleic acid into mamnmlian hosts, as well as other hosts, is well known in the art, and 
will vary with the host cell used. Techniques include dextran-mediated transfection, calcium 
phosphate precipitation, polybrene mediated transfection, protoplast frision, electroporation, viral 
infection, encapsulation of the poiynucleotide(s) in liposomes, and direct microinjection of the DNA 
into nuclei. 

CPPs can be purified from culture supematants of mammalian cells transiently transfected or 
stably transformed by an ^qnression vector carrying a CPP-encoding sequence. Preferably, CPP is 
purified fcom culture supematants of COS 7 cells transiently transfected by the pcD expressim 
vector. Transfection of COS 7 cells witii pcD proceeds as follows: One day prior to transfection. 
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approximately 10^ COS 7 monkey cells are seeded onto individual 100 mm plates in Dulbecco's 
modified Eagle medium (DME)(xmtaining 10% fetal calf sraum and 2 mM Toperfoim 
the transfection, the medium is agitated from each plate and replaced with 4 ml of DME containing 
50 mM Tris JiQ pH 7.4, 400 mg/ml DEAE-Dextran and 50 \xg of plasmid DNA- The plates are 
incubated far four hours at 37®C, then tiie DNA-containing medium is removed, and the plates are 
washed twice with 5 ml of serum-free DME. DME is added back to the plates which are then 
incubated for an additional 3 hrs at 37^. The plates are washed once with DME, after which DME 
containing 4% fetal calf serum, 2 mM glutamine, peniciUin (100 U/L) and streptomycin (100 jig/L) at 
standard concentrations is added. The cells are tiien incubated for 72 hrs at 37°C, after which the 
growth medium is collected for purification of CPP. Plasmid DNA for the transfections is obtained 
by growing pcD(SRa), or like expression vector, containing the CPP-encoding cDNA insert in E. coli 
MC1061 (described by Casadaban and Coh^ J. Mol. BioL, Vol. 138, pgs. 179-207 (1980)), or like 
orgamsm. The plasmid DNA is isolated from the cultures by standard techniques, e.g. Sambrook et 
al.. Molecular Cloning: A Laboratory Manual, Second Edition (Cold Spring Harbor Laboratory, New 
York, 1989) or Ausubel et al (1990, cited above). 



O^Ps may also be produced in insect cells. Expression vectOTS for the transformation of insect 
cells, and in particular, baculovirus-based expression vectors, are well known in the art. In one such 
system, the CPP-encoding DNA is fused upstream of an epitope tag contained within a baculovirus 
expression vector. AutograpJia califomica nuclear polj^edrosis virus (AcNPV) is used as a vector to 
express foreign graes in Spodopterafmgiperda Sf9 cells ot in Trichoplusia larvae. The CPP-encoding 
sequence is cloned mto a nonessential region of the virus, such as the polyhedrin gene, and placed 
under control of the polyhedrin promoter. Successful insertion of a CPP-encoding sequence will 
render the pol>1iednn gene inactive and produce recombinant virus lacking coat protein coat Hie 
recombinant vniises are then used to infect S. frugiperda cells or Trichoplusia larvae in which the 
CPP is expressed (Smith et al„ J. Wol. 46:584 (1994); Engelhard E K et al., Proc. Nat Acad ScL 
91:3224-3227 (1994)). Suitable epitope tags for fusion to flie CPP-encoding DNA include poly-his 
tags and immunoglobulin tags (like Fc regions of IgG). A variety of plasmids may be employed, 
including commercially available plasmids such as pVL1393 (Novagen). Briefly, the CPP-encoding 
DNA or the desired portion of the CPP-CTcoding DNA is amplified by PCR with primers 
complCTisntaiy to the 5' and 3 ' regions. The 5 ' primer may incorporate flanking restriction sites. The 
PCR product is then digested witii the selected restriction eazymes and subcloned into an expr^sion 
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vector. Recoinbhiant baculovinis is g^^ated by co-transfecting the above plasmid and 
BaculoGoldTM virus DNA (Phamnngm) into Spodoptera frugiperda ("SS") cells (ATCC CRL 
1711) using lipofectin (conunercially available from GIBCO-BRL), or other methods knovm to those 
of skiU in the art Virus is produced by day 4-5 of culture in Sf9 cells at 28°C, and used for further 
amplifications. Procedures are paformed as farther described in O'Reilley et al., BACULOVIRUS 
EXPRESSION VECTORS: A LABORATORY MANUAL, Oxford University Press (1994). Extracts may 
be prqpared from recombinant virus-infected Sf9 cells as described in Riqjert et al^ Nature 362: 175- 
179 (1993). Altemadvely, expressed epitope-tagged CPP can be purified by affinity chromatography, 
or for exanq>le, purification of an IgG tagged (or Fc tagged) CPP can be performed using 
chromatography techniques, including Protein A or protein G column chromatography. 

Evaluation of Gene Ejq)ression 

Gctc expression may be evaluated in a sample directly, for exan:q>le, by standard techniques 
known to those of skill in the art, e,g.. Northern blotting to determine the transcription of mRNA, dot 
blotting (DHA or RNA), or in situ hybridization, using an appropriately labeled probe, based on the 
sequences provided herein. Alternatively, antibodies may be used in assays for detection of 
polypeptides, nucleic acids, such as specific diQ)lexes, including DNA duplexes, RNA duplexes, and 

earned out where the duplex is bound to a sur&ce, so that i^on the formation of dupl^ on the 
surface, tiie presence of antibody bound to the duplex can be detected. Gene ejqiression, alternatively, 
may be measured by immunohistochemical staining of cells or tissue sections and assay of cell culture 
or body fluids, to directly evaluate the expression of a CPP polypeptide or polynucleotide. Antibodies 
useful for such immunological assays may be either monoclonal or polyclonal, and may be prepared 
against a native sequence CPP. Protein levels may also be detected by mass speclxometiy. A fiirtiier 
m^hod of protein detection is with protein ch^. 

Purification of Expressed Protein 

£xpressed CPP may be purified or isolated after expression, using any of a variety of methods 
known to those skilled in the art The appropriate technique will vary depending upon what other 
components are present in the ample. Contaminant components that are removed by isolation or 
purification are materials that would typically intefi^ witii diagnostic or therE^}eutic uses for &e 
polypeptide, and may include oizymes, hormones, and other solutes. The purification step(s) selected 
will d^end, for example, on the nature of the production process used and the particular CPP 
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produced As CPPs are secreted, they may be recovered from culture medium. Alternatively, the 
CPP may be recovered from host cell lysates. If membrane-bound, it can be released from flje 
membrane using a suitable detergent solution (e.g. Triton-X 100) or by enzymatic cleavage. 
Alternatively, cells en^loyed in expression of CPP can be disrupted by various physical or chemical 
means, such as freeze-lhaw cycling, sonication, mechanical disruption, or by use of cell lysing agents. 
Exemplary purification methods include, but are not limited to, ion-exchange colimm 
chromatography; chromatography using silica gel or a cation-exchange resin such as DEAE; gel 
filtration using, for exanq)le, Sephadex G-75; protein A Sepharose columns to remove contaminants 
such as IgG; chromatograq3hy using metal chelating columns to bind epitope-tagged forms of the CPP; 
ethanol precipitation; reverse phase HPLC; chromatofocusing; SDS-PAGE; and ammonium sulfete 
precipitation, Oniinarily, an isolated CPP will be prepared by at least one purification step. For 
example, the CPP may be purified using a standard anti-CPP antibody column. Ultrafiltration and 
dialysis techniques, in conjunction with protein concentration, are also useful (see, for exan^le, 
Scopes. R., PROTEIN PURIFICATIOK Springer-Verlag, New Yrak, N.Y., 1982). The degree of 
purification necessary will vary depoiding on the use of the CPP. In some instances no purification 
win be necessaiy. Once expressed and purified as needed, the CPPs and nucleic acids of the present 
invention are useful in a ninnber of applications, as detailed herein. 

Transgenic animals 

The host cells of the invention can also be used to produce nonhuman transgenic animals. For 
exanQ>le, in one embodiment, a host cell of the invention is a fertilized oocyte or an embryonic stem 
cell into which CPP-coding sequences have been introduced. Such host cells can then be used to 
create non-human transgenic animals in which exogenous CPP sequences have been introduced into 
their genome or homologous recombinant animals in which endogenous CPP sequences have been 
altered. Such animals are useful for studying the function and/or activity of a CPP or fragment thereof 
and for identifying and/or evaluating modulators of CPP biological activity. As used herem, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a rodent such as a 
rat or mouse, in which one or more of the cells of the animal include a transgene. Other exanqjles of 
transgenic animals include non-human primates, sheep, dogs, cows, goats, chickens, amphibians, etc. 
A transgCTe is exogenous DNA which is integrated into the genome of a cell from which a transgeiic 
animal develops and which remains in the genome of flie mature anima:, thereby directing the 
expression of an encoded gme product in one or more cell types or tissues of the transgenic animal. 
As used herein, a "homologous recombinant animal" is a non-hinnan animal, preferably a mammal. 
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more preferably a mouse, in which an endogenous gene has been altered by homologous 
recombination between the endog^ous graie and an exogenous DKA molecule introduced into a cell 
of the animal, e.g., an embryonic ceU of the animal, prior to development of tiie animal. 

A transgenic animal of the invention can be cheated by introducing a CPP-encoding nucleic 
acid into the male pronuclei of a fertilized oocyte, e.g., by microinjection or retroviral infection, and 
allowing the oocyte to develop in a pseudopregnant female foster aniimd. The CPP cDNA sequence or 
a fragment diereof can be introduced as a transg^e into the genome of a non-human animal. 
Alternatively, a nonhuman homologi^ of a human CPP-CTCcding gene, such as &om mouse or rat, can 
be used as a Iransgene. !hitronic sequences and polyadenylation signals can also be included in the 
transgene to increase the efficiency of expression of the tiansgene. A tissue-specific regulatory 
sequence(s) can be operabiy linked to a CPP transgene to direct expression of a CPP to particular 
cells. Methods for generating transgenic animals via embryo manipulation and microinjection, 
particularly animals such as nmce, have became convCTttional in the art and are described, for 
example, in U.S. Pat Nos. 4,736,866 and 4,870,009, both by Leder et aL, U.S. Pat No. 4,873,191 by 
Wagn^ et aL and in Hogan, B., Manipulating the Mouse Embryo, (Cold Spring Harb^ Laboratory 
Press, Cold Spring Harbor, N.Y., 1986, the disclosure of which is incorporated hmin by reference in 
its entirety). Similar methods are used for production of other transgenic anin^ls. A transgenic 

expression of CPP mRNA in tissues or cells of the animals. A transgCTic founder aninffll can ^ea be 
used to breed additional animals carrying the transgene. Moreov^, transgenic animals carrying a 
transgene encoding a CPP can further be bred to other transgenic animals carrying other transgenes. 

To create an animal in which a desired nucleic acid has been introduced into the genome via 
homologous recombination, a vector is prepared which contains at least a portion of a CPP^encoding 
sequence into which a deletion, addition or substitution has been introduced to thereby alter, e.g., 
functionally disrupt, the CPP-racoding sequence. The CPP-encoding sequence can be a human gene, 
but more preferably, is a non-human homologue of a human CPP-encoding sequence (e.g., a cDNA 
isolated by stringent hybridization with a nucleotide sequence coding for a CPP). For example, a 
mouse CPP-encoding sequmce can be used to construct a homologous recombination vector suitable 
for altering an mdogenous gene in the mouse genome. In a preferred embodiment, tl^ vector is 
designed such tiiat, vpon homologous recombination, the endogenous CPP-encoding sequence is 
functionally disrupted (i.e., no longo: encode a functional protein; also referred to as a "knock ouf ' 
vector). Alternatively, the vector can be designed such that, upon homologous reconibination, the 
endogenous CPP-mcoding sequ^tce is mutated or otherwise alt^ied but still encodes functional 
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protein (e.g., the upstream regulatory region can be ahdied to thereby alt^ Ihe expression of the 
endogenous CPP-encoding sequence). M the homologous recombination vector, the altered portion of 
the CPP-encoding sequ^ce is flanked at its 5' and 3' ends by additional nucleic acid sequence of the 
CPP gene to allow for homologous recombination to occur between the exogenous sequence carried 
by tiie vector and an endogenous gene in an embryonic stem cell. The additional flanlnng niicleic acid 
sequence is of sufficient length for successful homologous recombination with the endogenous gene. 
Typically, several kilobases of flanking DNA (both at the 5* and 3' ends) are included in tiie vector 
(see e,g., Thomas, K. IL and Capecchi, M. IL (1987) Cell 51:503, the disclosure of which is 
incorporated herein by refermce in its entirety, for a description of homologous recombination 
vectors). The vector is introduced into an embryonic stem cell line (e.g., by electroporation) and cells 
in which the introduced CPP-^coding sequence has homologously recombined with the endogenous 
gene are selected (see e.g., Li, E. et al. (1992) CeU 69:915, the disclosure of which is incorporated 
herein by reference m its entirety). The selected cells are then injected into a blastocyst of an animal 
(e.g., a mouse) to form aggregation chimeras (see e.g., Bradley, A. in Teratocarcinon^ and 
Embryonic Stem Cells. A Practical Approach, E. J. Rob^on, ed. (ERL, Oxford, 1987) pp. 1 13-152, 
the disclosure of which is incorporated herein by reference in its entirety). A chimeric embryo can 
then be implanted into a suitable pseudopregnant female foster animal and the embryo brought to 

animals in which all cells of the animal contain the homologously recombined DNA by germline 
transmission of the iransgene. Methods for constructing homologous recombination vectors and 
homologous recombinant animals are described further in Bradley, A. (1991) Current Opinion in 
Biotechnology 2:823-829 and in PCT International Publication Nos.: WO 90/1 1354 by Le Mouellec 
et al.; WO 91/01 140 by Smithies et aL; WO 92/0968 by Zijlstra et al.; and WO 93/04169 by Bems et 
al., the disclosures of which are incorporated herein by reference in their entireties. 

In another embodiment, transgenic non-human animals can be produced which contain 
selected sjrstems which allow for regulated expression of the transgme. One example of such a 
system is the cre/loxP reconibinase system of bactmophage PI. For a description of the cre/loxP 
recombinase system, see, e.g., Lakso et al. (1992) PNAS 89:6232-6236, the disclosure of which is 
incorporated herein by reference in its entirety. Anotiaer example of a recombinase system is the FLP 
recombinase system of Saccharomyces cerevisiae (OGorman et al. (1991) Science 251:1351-1355, 
the disclosure of which is incorporated herein by reference in its entirety). If a cre/loxP recombinase 
system is used to regulate CTqn-ession of the transgene, animals containing transgenes mcoding botb 
the Cre recombinase and a selected protein are required. Such animaH can be provided throu^ the 
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constniction of "double" transg^c animals, e.g., by mating two transgenic animals, one containing a 
tiansgene ^coding a selected protein and the other containing a transgene encoding a recombinase. 

Assessing CPP activity 

It will be appreciated that the invention fiirfher provides methods of testing the activity of or 
obtaining fimctional fragments and variants of CPPs and CPP sequences. Such methods involve 
providing a variant or modified CPP-encoding nucleic acid and a^essing wheth^ the encoded 
polypeptide displays a CPP biological activity. Encompassed is thus a method of assessing the 
function of a CPP comprismg: (a) providing a CPP, or a biologically active fragment or homologue 
tbesreof; and (b) testing said CPP, or a biologically active fragment or homologue dtereof for a CPP 
biological activity under conditions suitable for CPP activity. Cell free, cell-based and in vivo assays 
may be used to test CPP activity. For example, said assay may comprise expressing a CPP nucleic 
acid in a host cell, and observing CPP activity in said cell and otha: affected cells, hi another 
example, a CPP, or a biologically active fragment or honK>logue thereof is contacted with a cell, and a 
CPP biological activity is observed. 

CPP biological activities include: (1) indicating that an individual has or will have a 
cardiovascular disorder, (2) circulating through the bloodstream of individuals with a cardiovascular 

flie ability to goierate an anti-CPP specific antibody; and for CPP 2: (5) int^acting with a CPP target 
protein, preferably a lipase; (6) stabilizing &e active site of a lipase; (7) increasing Kpase activity; (8) 
interacting with a CPP target molecule such as a phospholipid, micelle, or triglyceride; and (9) 
forming at least one disulfide bond; for CPP 9: (5) forming intramolecular ammo acid side chain 
interactions such as hydrogen, amide, or especially disulfide links; (6) interaction with a CPP target 
molecule, preferably an RNA molecule or virion (such as respiratory syncytial virus or RSV); (7) 
antiviral activity, and (8) hydrolysis of RNA phosphodiester bonds; for CPP 21: (5) fomiing 
intramolecular amino acid side chain interactions such as hydrog^ amide, or preferably disulfide 
link^ and (6) interacting with a CPP target molecule, in particular, cholesterol; for CPP 17: (5) 
forming intramolecular amino acid side chain interactions such as hydrogen, amide, or preferably 
disulfide Imks; (6) interaction witii a CPP target molecule, preferably a low-density lipoprotein or 
bactoial endotoxin; (7) neutralizing bacterial endotoxins; (8) promoting mast cell chemotaxis; (9) 
undergoing posttranslatioi^ processing, for example, specific proteolysis; (10) fimctioning as an 
antimicrobial defense; and (11) inhibitmg contraction of smooth muscle cells; and for CPP 20: (5) 
forming intramolecular amino acid side chain int^ctions such as hydrogen, amide, or preferably 
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disulfide links; (6) interacting with a CPP target molecule, in particular, a kringle domain-containing 
peptides such as plasminogen; and (7) reducing tumor growth. 

CPP biological activity can be assayed by any suitable mediod known in tiie art Antigenicity 
and inmmnogenicity nmy be detected, for example, as de^aibed in the sections titled "Anti CPP 
antibodies" and 'TJses of CPP antibodies". Circulatim in blood plasma may be detected as described 
in 'Diagnostic and Prognostic Uses". Interaction with a CPP target molecule may be detected 
according to any of the methods described herein, for exanQ>le, in the section titled 'T)rug Screening 
Assays**, 

Detemiining the ability of tite CPP to bind to or interact with a CPP target molecule can be 
accomphshed by a method for directly or indirectly determining binding, as is common to the art 
Such methods can be ceK-based (e.g., such that binding to a membrane-bound CPP is detected) or cell 
free. Interaction of a test compound with a CPP be detected, for example, by coupling the CPP or 
biologically active portion thereof with a label group such that binding of the CPP or biologically 
active portion (hereof to its cognate target molecule can be detramined by detecting the labeled CPP 
or biologically active portion thereof in a conq)lex. For exan^le, the extent of con^lex formation 
may be measured by inamunopiecipitating the complex or by performing gel electrophoresis. 
Determining the ability of the CPP to bind to a CPP target molecule may also be accomplished using a 

- • - - 

C. (1991) Anal. Chem. 63:2338-2345 and Szabo et al. (1995) Curr. Opin. Struct. Biol. 5:699-705, tiie 
disclosures of which are incorporated herein by reference in their entireties. As used herein, "BIA" is 
a technology for studying biospecific interactions in real time, without labeling any of the interactants 
(e.g., BIAcore). Changes in the optical phenommon of sur&ce plasmon resonance (SPR) can be used 
as an indication of real-time reactions between biological molecules. 

For CPP 2, Lipase activity may be determined by any assay capable of detecting the 
hydrolysis reaction catalyzed by lipase. Fen: example, a reduction in the amount of substrate or, 
altematively, an increase in the amount of product, in a test san^le can be measured. A preferred 
assay is described by Pante^iini, et al. (Ann Clin Biochem (2001) 38:365-370, disclosures of which 
are incorporated herein in its entirety). Briefly, a chromogenic substrate, DGGR, is provided in an 
assay mixture with lipase, bile salts, calcium, and colipase. Lipase activity on the substrate is 
measured by detecting the formation of methybresorufin, the chromophore product, at 580nnL Lipase 
activity is ideally measured to detect the CPP biological activities of stabili2dng the active site of a 
lipase and increasing lipase activity. An exan^le of such a metiiod comprises the steps of: contacting 
a biological sanqsle with an assay mixture of chromogmiic substrate and lipase under conditions 
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suitable for lipa^ activity; and detecting lipase activity. PrefCTbly, the biological sanaple is plasma, 
more preferably plasma fixnn an individual suspected of having a cardiovascular disorder. CPP 
activity is indicated by an increase in lipase activity when compared to an ^propriate negative 
control 

For CPP 9, RKase activity rmy be assayed by the following exemplary techniques. Enzyme is 
incubated 15 min at 37C in Buffer B (ImM reduced glutathione (Signm Chemical Co., St Louis, MO), 
0.5 mM oxidized ghitathione, 0.1 mM DTT, 0.5 mM Tris-HCl 8.2, 0.Ol mM H>TA, and O.IM urea) 
with lug yeast tRNA (Sigma) as a substrate. Electrophoresis is carried out in a 1.5% agarose gel m 
0.09 M Tris-phosphate, 0.002 M EDTA, and 0.5 microg/ml ethidium bromide. RNA is visualized by 
UV. RNase activity leads to tiie disappearance of the full-lmgtii RNA. Alternatively, RNase activity 
may be detected spectrophotometrically. Briefly, the concentration of perchloric acid-soluble 
ribonucleotides geuCTated from acid-insoluble yeast tRNA (Sigma) in a 40 mM sodium phosphate, pH 
73, buffer by a given quantity of RNase is measured spectrophotometricaUy at 260 nm. For example, 
an RNase-containing san^le conespcmding to approximately 100 nM recombinant RNase is added in 
a 0.8 ml reaction volume with 10 \il of 4 mg/ml yeast tRNA. increasing RNase activity results in 
reduced 260 nm values. 

For CPP 9, antiviral activity may be detected as follows. Briefly, CPP 9-containing sample at 

10^ infectious units/ml) in culture medium (Iscove's Modified Dulbecco's Medium witii 10% heat- 
inactivated fetal calf serum and 2 mM glutandne) and incubated with gratie rotation at room 
tenperature. After 2 h incubation, 200 microl of the suspension is used to infect target cells (human 
respiratory epithelial H£p-2 cells, in the case of respiratory syncytial virus) present in conflumt 
monolayers (3-4 x 10^ cells/monolayer) on coverslips within a one dram shell vial (Viromed, 
Minneapolis, MN). After spin anq)lification (700 g at 22*'C) and 16 h incubation (37°C, 5% CO2), the 
primary infected cells are identified by iimnunofluorescent staining (mouse anti-RS V blend, FTTC- 
labeled; Chraricon International, Temcncula, CA). Data is presented as infectious units i SD. 

For CPP 1 7, interaction with an LDL particle may be detected according to the methods of 
Higazi, et al. Blood 96:1393-1398 (2000), relevant disclosure of which is incorporated herein. Briefly, 
the peptides of interest were labelled with radioactive iodine. LrDL or a BSA control (lOnM) was 
incubated widi 0-5 uM ^^^I-defensin for Ihr at 37C. Hie mixture v^^as apphed to a polyaciylamide gel 
and the amount of radioactivity in the appropriate molecular weig^ fraction recorded. Specific LDL 
binding was defined by the increased amount of radioactive peptide bound to LDL compared to BSA. 
Antimicrobial activity may be detected according to any method known in the art, such as those 
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• described by Porter et al. (LifectJbnmim. 65: 2396-2401 (1997)). Briefly, L, monocytogenes EGD, E. 
coli ML35p, S, typMmurium 14028S and 79S3S, and C. albicans 820 are used. The bacterial cultures 
are harvested in nridgrowdi phase, washed, and resu^ended at a working dilution of 10^ bacteda/ml 
in 10 mM sodium phosphate (pH7.4), 1% Trypticase Soy Broth. C, albicans is harvested in tiie same 
manner but isolated in stationary phase. Colony Forming Unit (CFU) assajrs and Radial diffusion 
assays are performed as described in the presence of a test compound, a positive control compound, or 
no compound. A reduction in the value obtained for diher the CFU or radial diffusion assays in the 
presence of a test compound as conqsared to the control without cornpound indicates that the 
coiiq>ound has antimicrobial activity. 

For CPP 2, CPP 9, CP? 17, CPP 20 and CPP 21, intramolecular interactions may be detected 
by sequence-based structural predictions. Such predictions are generally based on X-ray 
crystallography or NMR structural data for a polypeptide with similar sequence. Detection of 
intramolecular interactions nmy also be accomplidied using SDS-PAGE. For the example of disulfide 
bonds, links formed between different portions of a given protein result in a more compacted protein, 
and tiius, a reduced apparent molecular weight Disulfide bonds may be disrupted by a reducing 
agent, for example, dithiothreitol (DTT). A protein sarnpie that has been treated with a reducing 
agsit may thus be coirg)ared to an untreated control by SDS-PAGE to detect a change in apparent 

For CPP 17, antimicrobial activity nay be detected according to any method known in the art, 
siwh as tiiose described by Porter et al. (Infect-ImmmL 65: 2396-2401 (1997)). Briefly, L. 
monocytogenes EGD, coli ML35p, S. typhimurium 14028S and 7953S, and C. albicans 820 are 
used. The bacterial cultures are harvested in irddgrowth phase, washed, and resuspended at a working 
dilution of 10^ bacteria/ml in 10 mM sodium phosphate (pH7.4), 1% Trypticase Soy Broth. C. 
albicans is harvested in the same manner but isolated in stationary phase. Colony Fomnng Unit 
(CFU) assays and Radial difiusion assays are performed as described in the presence of a test 
conqpound, a positive control con^oimd, or no conq^ound. A reduction in the value obtained for either 
the CFU or radial diffusion assa3r5 in the presence of a test compoimd as compared to tiie control 
without conqx)und indicates that the compound has antimicrobial activity. 

For CPP 1 7, specific proteolysis may be detected by comparing tiie molecular wei^t of a 
^mple peptide to that of a peptide of known molecular weight. Molecular weights are easily 
compared according to any mefliod conmion to the art such as SDS-PAGE, gel chromatography, or 
mass spectrometry. Preferably, the molecular weight of a test peptide is obtained by mass 
spectrometry and corrq>ared to a database co mpr i sin g molecular wei^its of peptides with 
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posttranslational modifications. Bx^plaiy databases include Genpept, SWISSPROT, EMBL, and the 
Protein Sequence Database. Such techniques are detailed further herein. 

For CPP 17, chemotactic activity may be assessed as described for T cells in U.S. Patent 
5837247. Briefly, lymphocyte migration was asse^ed using a 48 well microchemotaxis chamber 
(Neuro Probe Inc. Cabin John, Md.). 25 ul of the sample to be tested diluted in chemotaxis medium 
was placed in the Iowa- compartment and 50 ul of cell suspoision (at 5x 10^ cells/ ml) in the upper 
compartment The two compartments were separated by a polycarbonate filter (for T cells, 5 jim pore 
size, larger for oth^ lymphocytes) coated with 10 ug/ml collagen type IV ovCTiight at 4C. The 
apparatus was incubated at 37C for 3 h in humidified air with 5% CO2. At the end of the incubation 
period, the filter was removed, fixed and stained with LeukoStat (Fisher Scientific, Pittsburg Pa.). 
The number of cells that migrate through the filter w^e counted by h^t microscopy per high-power 
fields. Ihe results are oqjressed as the mean (±SD) value of the migration in tripUcate samples and 
are represmtative of at least three experiments. The statistical significance of the number of cells 
migrating in response to stimuh versus control medium was calculated using the Student's T test. For 
tbe m vivo assay, BALB/C and CB-17 scid/scid (SCID) mice were obtained from the Animal 
Production Area (NCI-FCRDC, Frederick, Md.). Mice were used at 8-12 wk of age and kept in 
pa&ogen fiw conditions. SCID mice wa-e treated with anti-ASGM-1 and injected with 1 x 10^ huPBL 

PBS were injected daily subcutaneously into the same injection site. The injection site was examined 
histologically eiflier at 4 hr after the first or at 24 hr by 4 hr after a second mjection. Experiments 
w^e performed on three to four mice per group in duplicate. 

For CPP 17, vascular smooth muscle contraction may be measured as described in Nassar, et 
al. (Blood 100:4026-32 (2002)) or EP0582631B1, relevant disclosures of which are incorporated by 
referCTce. Generally, aortic ring tissue sections from rat or another suitable animal are used for the 
assay. The tissue section rings are treated with, for exanq>le, epinqihiine, a powerful contractile 
stimulant and the lengths are measured in the presence and abssice of a test substance. An increased 
length of the section relative to the negative control indicates thsX a substance reduces contractile 
activity. 

For CPP 20, any method known in the art may be used to assay for tumor growth. For 
example, in the experiments detailed in W099/46282 (relevant disclosures of which are mcorporated 
herein by reference), C57B16/J mice were implanted with Lewis lung carcinomas. A suspension of 
10^ tumor cells in 100 microliters of PBS was injected into the subcutaneous dorsa of tiie mice. 
Tumors were measured with a dial-cahper, and volumes were determined using a general formula for 
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the volume of an effipsoid sphere (widtf x length x 0.52). The mice were randomized into two groups 
once the tumor volume reached about 160 mm3. One group was treated with the substance and 
the oths- received a PBS control. Tumor growth was monitored over 1 1 days. 

Cardiovascular disorders may be diagnosed by any method detranined qjpropriate for an 
individual by one of skill in the art. Further exanqjles of synqitoms and diagnostics may be found in 
the Bacl^ound section, and are best determined ^jpropriately by one of skill in the art ba^ on the 
particular profile of a patient. 

hitramolecular interactions may be detected by sequence-based structural predictions. Such 
predictions are gCTieraily based on X-ray crystallography or NMR structural data for a polypeptide 
with similar sequence. Detection of intramolecular interactions may also be acconqjlished using 
SDS-PAGE. For the example of disulfide bonds, links formed between different portions of a given 
protein result in a more contacted protein, and thus, a reduced apparent molecular weight. Disulfide 
bonds may be disrupted by a reducing agent, for exan^le, dithiotfareitol (DTT). A protein sample that 
has been treated with a reducing agent may thus be compared to an untreated control by SDS-PAGE 
to detect a change in apparent molecular weight Such methods are common to the art 



The present invaition provides antibodies and binding conqwsitions qjecific for CPPs. Such 
antibodies and binding conq)ositiQns include polyclonal antibodies, monoclonal antibodies. Fab and 
single chain Fv fiagments thereof, bispecific antibodies, heteroconjugates, and humanized antibodies. 
Such antibodies and binding compositions may be produced in a variety of ways, mcluding hyfaridoma 
cultures, recombinant esqiression in bacteria or mammalian cell cultures, and recombmant ejqjression 
in transgenic animals. There is abundant guidance in the Uterature for selecting a particular 
production methodology, e.g. Chadd and Chamow, Curr. Opin. Biotechnol., 12: 188-194 (2001). 

The choice of manufacturing methodology depends on several fectors includmg the antibody 
stnicture desired, the inqjortance of caibohydrate moieties on the antibodies, ease of culturing and 
purification, and cost. Many different antibody structures may be generated using standard 
expression technology, inchiding fiill -length antibodies, antibody fiagments, such as Fab and Fv 
fiagments, as well as chimeric antibodies comprising conqjonents fi-om different species. Antibody 
fiagments of smaU size, such as Fab and Fv fi-agmoits, having no effector fimctions and limited 
pharmokinetic activity may be generated in a bactmal expression system. Single chain Fv fiagments 
are highly selective for in vivo tumors, show good tumor penetration and low mmumogenicity, and 
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are cleared tapidly from the blood, e.g. Freyre et al, J. Biotecfanol., 76: 157-163 (2000). Thus, such 
molecules are desirable for radioimmunodetection. 

Polyclonal Antibodies 

The anti-CPP antibodies of the present invention may be polyclonal antibodies. Such 
polyclonal antibodies can be produced in a mammal, for example, following one or more injections of 
an immunizEng agent, and preferably, an adjuvant Typically, the immuni^g agent and/or adjuvant 
will be injected into the mammal by a series of subcutaneous or intraperitoneal injections. The 
immiinizing agent may include CPPs or a fusion protein thereof. It may be useful to conjugate the 
antig^ to a protein known to be immunogenic in the mammal being immunized. Examples of such 
immunogenic proteins include, but are not limited to, keyhole linnet hemocyanin (KLH), methylated 
bovine serum albimun (mBSA), bovine serum albumin (BSA), Hepatitis B surface antigen, serum 
albumin, bovine thyroglobulin, and soybean tiypsin inhibitor. Adjuvants include, for exanq)le, 
Freund's conq>lete ac^uvant and MPL-TDM adjuvant (monophosphoiyl Lipid A, synthetic trehalose 
dicoiyno-n:Q^late). The immimization protocol may be determined by one skills in the ait based on 
standard protocols or by routine experimentation. 

Alternatively, a crude protein preparation which has been enriched for a CPP or a portion 

into the iKm-faun:ian mammal in the presoice of an appropr iate adjuvant If the serum contains 
polyclonal antibodies to undesired ^topes, the polyclonal antibodies are purified by ixmnunoaflHnity 
chromatography. 

Effective polyclonal antibody production is affected by many factors related both to the 
antigeo and the host species. Also, host animals vary in response to site of inoculations and dose, with 
both inadequate and exce^ve doses of antigen resulting in low titer antisera. Small doses (ng level) 
of antigen administered at multiple intradermal sites ^ypear to be most reliable. Techniques for 
producing and processing polyclonal antisera axe known in the art, see for exanqile, Mayer and 
WaDoer (1987), the disclosure of which is incorporated herein by reference in its entirety. An effective 
immunization protocol for rabbits can be found in Vaitukaitis, J. et al. J. Clin. Endocrinol. Metab. 
33:988-991(1971), the disclosiffe of which is incorporated by reference in its entirety. Boosts 
injections can be givm at regular inten^ls, and antiserum harvested when antibody titer thereof as 
determined semi-quantitatively, for exan^le, by double immunodif&ision in agar against known 
concentrations of the antigen, begins to fall. See, for example, Ouchterlony, O. et al.. Chap. 19 in: 
Handbook of Ejjperimental Immunology D. Wier («i) Blackwell (1973). Plateau concentration of 
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antibody is usually in the range of 0 . 1 to 0^ mg/ml of serum. Af&nity of the antisera for the antigen i s 
detCTidned by preparing competitive binding curves, as described, for exanqile, by Fisher, D., Chap. 
42 in: Manual of Qinical Immunology, 2d EA (Rose and Friedman, Eds.) Amer. Soc. For Microbiol., 
Washington, D. C. (1980). 

Monoclonal Antibodies 

Alternatively, the anti-CPP antibodies may be monoclonal antibodies. Monoclonal antibodies 
may be produced by hybridomas, wherein a mouse, hamster, or other appropr iate host animal, is 
imm uniz e d wifli an immunizing agent to elicit lymphocytes that produce or are capable of producing 
antibodies that will specifically bind to the immunizing agent, e.g. Kohler and Miistein, Nature 
256:495 (1975). The immunizing agent will typically include tiie CPP or a fusion protein thereof and 
optionally a carrier. Alternatively, the lymphocytes may be nnmunized in vitro. Gently, spleen 
cells or lymph node cells are used if non-human mammalian sources are desired, or periphe^ blood 
lyn^hocytes CTBLs") are used if cells of human origin are desired. Hie lyn^hocytes are fused with 
an immortalized cell line using a suitable fiising agent, such as polyethylene glycol, to produce a 
hybridoma cell, e.g. Goding, MONOCLONAL ANIWODIES: PRINCIPLES AND PRACTICE, 
Academic Press, pp. 59-103 (1986); Uddeil and Oyer, A Practical Guide to Monoclonal Antibodies 

Press, New York, 1994). In general, irmnortahzed cell lines are transfonned mammalian cells, for 
exanqjle, inyeloma cells of rat, mouse, bovine or human origin. The hybridoma cells are cultured in a 
suitable culture medium that preferably contains one or more substances that inhibit the growth or 
survival of unfused, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanlhine guanine phosphoribosyl transferase (HGPRT), the culture medium for the hybridomas 
Really will include hypoxanthine, aminopterin, and fliymidine (HAT), substances which prevent the 
gn)w& of HGPRT-deficirat cells. Preferred immOTtalized cell lines are those that fuse efficiently, 
si^ort stable high level production of antibody, and are sensitive to a medium such as HAT medium. 
More preferred immortalized cell lines are murine or human myeloma lines, which can be obtained, 
for exanqjle, from the American Type Culture Collection (ATCC), Rockville, MD. Human myeloma 
and mouse-human heteromyeloma cell lines also have been described for the production of human 
monoclonal antibodies, e.g. Kozbor, J. hnmunoL 133:3001 (1984); Brodeur et al.. Monoclonal 
Antibody Production Techniques and Applications, Marcel Dekker, Inc., New York, pp. 51-63 (1987). 

The culture medium (supernatant) in vdiich the hybridonm cells are cultured can be assayed 
for the presence of monoclonal antibodies directed against a CPP. Preferably, the binding specificity 
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of monoclonal antibodies pres^ in the hybridoma supernatant is detenxiined by imnninoprecipitation 
or by an in vitro binding assay, such as radio- innnunoas^y (RIA) or Enzyme-Linked hnmuno 
Sorbent Assay (ELISA). Appropriate techniques and as^ys are known in die art The binding afiBnity 
of tiie monoclonal antibocfy can, for example, be determined by the Scatchard analysis of Munson and 
Pollard, AnaL Biochem. 107:220 (1980). Afte the desired antibody-producmg hyteidoma cells are 
idmtified, the ceUs may be cloned by limiting dilution procedures and grown by standard methods 
(Goding, 1986, si^ra). Suitable culture media for this purpose include, for exan^le, Dulbecco's 
Modified Eagle's Medium and RPMI-1640 medium Alternatively, the hybridoma cells may be grown 
in vivo as ascites in a manmial. The monoclonal antibodies secreted by selected clones may be 
isolated or purified from the culture medium or ascites fluid by immunoglobulin purification 
procedures routh^ly used by those of skill in the art such as, for exan^le, protein A-S^han>se, 
hydro^grl-apatite chromatogfE^hy, gel electrophoresis, dialysis, or a£6nity chromatography. 

Hie monoclonal antibodies may also be made by recombinant DNA me&ods, such as those 
described in U.S. Pat No. 4,816,567. DNA ^coding the monoclonal antibodies of fee invention can 
be isolated from the CPP-specific hybridoma cells and sequenced, e.g., by using oHgcmucleotide 
probes that are capable of binding specifically to genes encoding the heavy and Ught chains of murine 
antibodies. Once isolated, the DNA may be inserted into an expression vector, which is then 



cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of ixKinoclonal 
antibodies in the recombinant host cells. The DNA also may be modified, for example, by substituting 
the coding sequence for the murine heavy and light chain constant domains for the homologous 
human sequences (Morrison et al., Proa Nat Acad ScL 81:6851-6855 (1984); Neuberg^ et al.. 
Nature 312:604-608 (1984); Takeda et al., Nature 314:452-454 (1985)). or by coval«itly joining to 
the immunoglobulin coding sequence all or part of the coding sequence for a non-immunoglobulin 
polypeptide. The n(»i-immunoglobulin polypeptide can be substituted for the constant domains of an 
antibody of the invention, or can be substituted for the variable domains of one antigen-combining 
site of an antibody of tiie invention to create a chimeric bivalent antibody. The antibodies may also be 
monovalmt antibodies. Methods for preparing monovalent antibodies are well known in the art For 
exan^le, in vitro methods are suitable for preparing monovalent antibodies. Digestion of antibodies to 
produce fragments thereof particularly. Fab fragments, can be accon^lished using routine techniques 
known in the art. 

Antibodies and antibocfy friagments characteristic of hybridomas of the invraition can also be 
produced by recombinant means by extracting messraiger RNA. constructing a cDNA library, and 
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selecting closies ^liich encode segments of the antibody molecule. The following are cxessnplary 
TQ^r&xcGS disclosing recombinant techniques for producing antibodies: Wall et al.. Nucleic Acids 
Research, Vol. 5, pgs. 31 13-3128 (1978); Zakut et al.. Nucleic Acids Research, VoL 8, pgs. 3591- 
3601 (1980); Cabilly et al., Proc. Natl. Acad Sci., Vol. 81, pgs. 3273-3277 (1984); Boss et al.. 
Nucleic Acids Research, VoL 12, pgs. 3791-3806 (1984); Amster et al.. Nucleic Acids Research, VoL 
8, pgs. 2055-2065 (1980); Moore et al., U.S. Patrat 4,642,334; Skerra et al. Science, Vol. 240, p^. 
1038-1041(1988); Kise et al, Scimce, VoL 246. pgs, 1275-1281 (1989); and U.S. patenis 6,054,297; 
5,530,101; 4,816,567; 5,750,105; and 5,648,237; which patents are incorporated by ref^^ce. In 
particular, such techniques can be used to produce interspecific monoclonal antibodies, whmin the 
binding region of one ^>ecies is combined with non-binding region of the antibody of another species 
to reduce irmnunogemcity, e.g. Liu et al., Proc. Natl. Acad, Sci., Vol. 84, pgs. 3439-3443 (1987), and 
patents 6,054,297 and 5,530,101. Preferably, reconibinantly produced Fab and Fv fragmoits are 
expressed in bacterial host systems. Prefimbly, full-loigtii antibodies are produced by mammalian 
cell culture techniques. More preferably, full-length antibodies are expressed in Chinese Hamster 
Ovaiy (CHO) cells or NSO cells. 

Both polyclonal and monoclonal antibodies can be screened by ELISA. As in other solid 
phase immunoassays, the test is based on tiie tendency of macromolecules to adsorb nonspecifically to 

formation of antigoi-antibody conq>lexes with a simple separation of such coitqilexes from unbound 
mat^al. To titrate anti-pq>tide serum, peptide conjugated to a carrier different from that used in 
immunization is adsorbed to the wells of a 96-well microtiter plate. The adsorbed antigen is then 
allowed to react in the wells with dilutions of anti-peptide sramn. Unboimd antibody is washed away, 
and the r em a inin g antigen-antibody complexes are allowed to react with an antibocfy specific for the 
IgG of the immunized animal. This second antibody is conjugated to an en2yme such as alkaline 
phosphatase. A visible colored reaction produced wbeaa. the eazyme substrate is added indicates 
which wells have bound antipeptide antibodies. The use of spectrophotomet^ readings allows better 
quantification of the amount of peptide-specific antibody bound. Hig}i-titer antisera 3deld a linear 
titration curve between 10**^ and 10~^ dilutions. 

CPP peptide carriers 

The invention include immunogens derived from CPPs, and immunogens ccnnprising 
conjugates between carriers and p^tides of the invention. The term immunogen as used herein refers 
to a substance which is capable of causing an immune response. The term carrier as used herein 
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refers to any substance which when chemically conjugated to a peptide of the invention permits a host 
organism immunized with the resulting conjugate to generate antibodies specific for the conjugated 
peptide. Carriers include red blood cells, bacteriophages^ proteins, or synthetic particles such as 
agarose beads. Prefo-abiy, carriers are proteins, such as serum albumin, gamma-globulin, lasyhole 
limpet hemoc3^anin (KIH), thyroglobulin, ovalbumin, fibnnogen, or the like. 

The gmeral technique of Hnking synthetic peptides to a carrier is desmbed in several 
references, e.g. Walter and Doolittle, "Antibodies Against Synthetic Peptides," in Setlow et al., eds.. 
Genetic Engineaing, Vol, 5, pgs. 61-91 (Plenum Press, N.Y^ 1983); Green et al. Cell, Vol. 28, pgs. 
477-487 (1982); Lemer et al., Proc. Natl. Acad, Sci., Vol. 78, pgs. 3403-3407 (1981); Shinrizu et al., 
U.S, Patrat 4,474,754; and Ganfield et al., U.S. Patent 4,3 1 1,639. Accordin^y, these references ai« 
incoiporated by reference. Also, techniques employed to link haptras to carriers are essentially the 
same as the above-refCTenced techniques, e.g. chapter 20 in Tijssen, Practice and Theory of l&izyme 
Imnnmoassays (Elsevier, New York, 1985). The four most commonly used schemes for attaching a 
peptide to a carrier are (1) glutaraldehyde for amino coupling, e.g, as disclosed by Kagan and Gbck, 
in Jaffe and Behrman, eds. Methods of Hormone Radioimmunoassay, pgs. 328-329 (Academic Press, 
N.Y., 1979), and Walter et al. Proc. Natl. Acad. Sci., Vol. 77, pgs. 5197-5200 (1980); (2) water- 
soluble carbodiimides for carboxyl to amino coupling, e.g. as disclosed by Hoare et al., J. Biol. 

sidechain coupEng, e.g. as disclosed by Bassin et al., pgs. 46^7, in Jaffe and Behrman, eds. (cited 
above), and Walter et al. (cited above); and (4) maleimidobenzoyl-N-hydro^Q^ccinimide ester 
(MBS) for coupling cysteine (or otiier sulfhydryls) to amino groups, e,g. as disclosed by Kitagawa et 
al., J. Biochem. (Tolgro), Vol. 79, pgs. 233-239 (1976), and Lemer et al. (cited above). A genaal rule 
for selecting an appropriate method for coupling a given peptide to a protein carrier can be stated as 
follows: the group involved in attachment should occur only once in the sequence, preferably at the 
apjwopriate €aid of tiie segnwnt For example, BDB should not be used if a tyrosine residue occurs in 
the main part of a sequence chosen for its potentially antigenic character. Similarly, centrally located 
lysines rule out the glutaraldehyde me&od, and the occurrences of aspartic and glutamic acids 
frequently exclude the carbodiimide approach. On the other hand, suitable residues can be positioned 
at either end of chosen sequence segment as attachment sites, whether or not they occur in the 
"native" protein Mquence. hitemal segments, unlike the amino and carboxy termini, will differ 
significantly at the ''unattached end" from the same sequence as it is found in the native protein where 
the polypeptide backbone is continuous. The problem can be remedied, to a degree, by acetylating the 
a-amino group and then attaching the peptide by way of its carboxy terminus. The coi^ling 
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efficioicy to the carrier protein is conv^ently measured by using a radioactively labeled peptide, 
prepared either by using a radioactive amino acid for one step of the synthesis or by labeling the 
completed peptide by the iodimtion of a tyrosine residue. The presence of tyrosine in the peptide also 
allows one to set up a sensitive radioimmune assay, if desirable. Therefore, tyrosine can be 
introduced as a terminal residue if it is not part of the peptide sequence defined by the native 
polypeptide. 

Preferred carrim are proteins, and prefenred protein carders include bovine serum albumin, 
myoglobulin, ovalbumhi (OVA), keyhole lin^t hemocyanin (KLH), or fte like. Peptides can be 
hnked to KLH through cysteines by MBS as disclosed by Liu et al., Biochemistry, Vol. 1 8, pgs. 690- 
697 (1979). The peptides are dissolved in phosphate-buffered salme (pH 7.5), 0.1 M sodium borate 
buffer.(pH 9.0) or 1.0 M sodium acetate bufifer (pH 4.0). The pH for the dissolution of the peptide is 
chosen to optimize peptide solubility. The content of finee cysteine for soliible peptides is determined 
by EUman's method, Ellman, Arch. Bioohem. Biophys., Vol. 82, pg. 7077 (1959). For each peptide, 4 
mg KLH in 0.25 ml of 10 mM sodium phosphate buffer (pH 7.2) is reacted with 0.7 mg MBS 
(dissolved in dimetiiyl foimamide) and stirred for 30 mfn at room tenqsQ-ature. Hie MBS is added 
dropwise to ensure that the local concentration of formamide is not too high, as KLH is insoluble in 
>30% fomfflmide. ITie reaction product, KLH-MBS, is ten passed through Sephadex G-25 

peak fitu:tions of fiie column eluate (monitored by OD280) is estimated to be ^^proximately 80%. 
KLH-MBS is tiien reacted with 5 nig peptide dissolved in 1 ml of the chosen buffer. The pH is 
adjusted to 7-7.5 and the reaction is stirred for 3 hr at room tempCTature. Coupling efBciency is 
monitored with radioactive peptide by dialysis of a san:iple of the conjugate against phosphate- 
buffered saline, and may range from 8% to 60%. Once the peptide-carrier conjugate is available, 
polyclonal or monoclonal antibodies are produced by standard techniques, e.g. as disclosed by 
Campbell, Monoclcmal Antibody Technology (Elsevier, New York, 1984); Hurrell, ed. Monoclonal 
Hybridoma Antibodies: Techniques and Applications (CRC Press, Boca Raton, FL, 1982); Schreier 
et al. Hybridona Techniques (Cold Spring Harbor Laboratory, New York, 1980); U.S. Patent 
4,562,003; or the like. In particular, U.S. Patent 4,562,003 is incorporated by reference. 

Htimamzed Antibodies 

The anti-CPP antibodies of the invention may further comprise humanized antibodies or 
human antibodi^. The term "humanized antibody" refers to humanized forms of non-human (e.g., 
murine) antibodies that are chimeric antibodies, immunoglobulin chains or fragmraits thereof (such as 
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Fv, Fab, Fab% F(ab'), or otiter antigen-binding partial sequences of antibodies) which contain some 
portion of the sequoice derived from non-hunsan antibody. Humanized antibodies include human 
immunoglobulins in which residues fiiom a complem^tary det^mining region (CDR) of the human 
immunogiobulin are replaced by residues &om a CDR of a non-human species such as mouse, rat or 
rabbit having the desired binding specificity, affinity and capacity. In general, the humanized antibocty 
will comprise substantially all of at least one, and generally two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immunoglobulin and all or 
substantially all of tiie FR regions are those of a human immunoglobulin consensus sequence. The 
humanized antibody optimally also will comprise at least a portion of an immunoglobulin constant 
region (Fc), typically that of a human immunoglobulin (Jones et al., Nature 321:522-525 (1986) and 
Presta, Ctar. Op. Struct. Biol 2:593-596 (1992)). Methods for humaniang non-hun^ antibodies are 
well known in the art. Generally, a humanized antibody has one or more amino acids introduced into 
it from a source which is non-hinnan in order to more closely resemble a human antibody, while still 
retaining the original binding activity of the antibody. Methods for humanization of antibodies are 
further detailed in Jones et al.. Nature 321 :522-525 (1986); Riechmann et al.. Nature 332:323-327 
(1988); and Verhoeyoi et al.. Science 239:1534-1536 (1988). Such "humanized" antibodies are 
chimeric antibodies in that substemtiaUy less Hasai an intact human variable domain has been 

Heteroconpigate Antibodies 

Heteroconjugate antibodies which con^rise two covalently joined antibodies, are also within 
the scope of the present invention. Heteroconjugate antibodies may be prepared in vitro using known 
methods in syn&etic protein chemistry, including those involving crosslinking agents. For exanq)le, 
immunotoxins may be prepared using a disulfide exctmnge reaction or by forming a ttiioether bond. 

Bispedfic Antibodies 

Bispecific antibodies have binding specificities for at least two differ^t antigens. Such 
antibodies are monoclonal, and preferably human or hiunanized. One of the binding specificities of a 
bispecific antibody of the presoat invention is for a CPP, and die other one is preferably for a cell- 
surface protein or recq^tor or receptor subunit Methods for nsildng bispecific antibodies are known 
in tiie art, and in general, the recombinant production of bispecific antibodies is based on die co- 
e?q}re^ion of two immunoglobulin heavy-chain/light-chain pairs in hybridoma cells, where the two 
heavy chains have diffCTKit ^ecificities, e.g. Milstein and Cuello, Nature 305:537-539 (1983). Given 
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• that the random a^ortment of immunoglobulm heavy and li^t chains results in production of 
potentially ten different antibody molecules by the hybridomas, purification of the correct nu>lecule 
usually requires some sort of afSnity purification, e.g. afBnity chromatography. 

Uses of CPP antibodies 

CPP antibodies are preferably specific for the CPPs of the invention and, as such, do not bind 
peptides derived from other proteins with high afimify. As used herein, the term "h^vy chain 
variable region" means a polypeptide (1) which is fix>m 1 10 to 125 amino acids in length, and (2} 
who^ amino acid sequence corresponds to that of a heavy chain of an antibody of tiie invention, 
starting fiom the heavy chain^s N-terminal amino acid. Likewise, die term "light chain variable 
region" means a polypeptide (1) which is fnsm 95 to 1 15 amino adds in length, and (2) whose amino 
acid sequence conesponds to that of a light clmin of an antibody of the invention, starting fi-om the 
light chain's N-tmninal amino acid. As used herein the term "monoclonal antibody" refers to 
homogeneous populations of immunoglobulins vAnoh are enable of specifically binding to CPPs. 

CPP antibodies may be used as fimctional modulators, preferably as antagonists. Preferably, 
antibody modulators of liie invention are derived fi'om monoclonal antibodies speci&c for CPPs. 
Monoclonal antibodies enable of blocking, or neutralizing, CPPs are selected by their ability to 

The use of antibody fiagments is also well known, e.g. Fab fi*agments: Tijssen, Practice and 
The<ny of Enzyme Immunoassays (Blsevier, Amsterdam, 1985); and Fv firagments: Hochman et aL 
Biochemistry, Vol. 12, pgs. 1 130-1 135 (1973). Sharon et al.. Biochemistry, Vol. 15, pgs. 1591-1594 
(1976) and Efariich et al., U.S. Patent 4^55,023; and antibody half molecules: Auditore- Hargreaves, 
U.S. Patent 4,470,925. 

Preiembly, monoclonal antibodies, Fv fiagnrats. Fab fiagments, or oth^ binding 
con^TOsiticms derived icom monoclonal antibodies of the invention have a hig^ afiSnity to CPPs. The 
afBnity of n:u)noclonal antibodies and related molecules to CPPs may be measured by convCTtional 
techniques including plasmon resonance, ELISA, or equilibrium dialysis. Affinity measuremoit by 
plasmon resonance techniques may be carried out, for example, using a BIAcore 2000 instrument 
(Biacoxe AB, U^^sala, Swedra) in accordance with ^e manufacturer's recommended protocol. 
Preferably, affiidly is measured by ELISA, as described in U.S. patent 6,235,883, for example. 
Preferably, the dissociation constant between CPPs and mcmoclonal antibodies of the invention is less 
than 10'^ molar. More prefmbly, such dissociation constant is less than 10'^ molar; still more 
pref^ably, such dissociation constant is less than 10"^ molar; and most preferably, such dissociation 
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constant is in tlie range of 10'^ to 10'*' molar. 

In addition, the antibodies of the present invention are use£il for detecting CPPs. Such 
detection methods are advantageously applied to diagnosis of cardiovascular disorders, in particular, 
coronary artery disease. The antibodies of the inv^ition may be used in most assays involving 
antigra-antibody reactions. The assays may be homogeneous or heterogeneous. In a homogeneous 
assay approach, tiie saxrple can be a biological sample or fluid such as serum, urine, whole blood, 
lymphatic fhrid, plasma, saliva, cells, tissue, and material secreted by cells or tissues cultured in vitro. 
The sample can be pretreated if necessary to remove unwanted materials. The immunological ruction 
usually involves the specific antibocfy, a labeled analyte, and the sample suspected of containing the 
antigen. The signal arising from the label is modified, directly or indirectly, i^n the binding of the 
antibody to the labeled analyte. Both immunological reaction and detection of the extent thereof are 
carried out in a hon:u>g^eous solution; Immunochemical labels which may be employed include fi^e 
radicals, fluorescent dyes, enz^es, bact^ophages, comzymes, and so fordt 

In a heterogeneous assay approach, the reagents are usually the sanqjle, the specific antibody, 
and means for producing a detectable signal. Ihe specimm is generaUy placed on a siqjport, such as a 
plate or a slide, and contacted with the antibody in a hquid phase. The st^port is thai separated j&om 
the liquid phase and either the support phase or the liquid phase is examined for a detectable signal 

presmce of the antigen in tte sample. Means for producing a detectable signal include the use of 
radioactive labels, fluorescent compounds, enzymes, and so forth. Exemplary hetwogeneoxis 
immunoassays are the radioimmunoassay, immunofluorescence methods, en2yme-linked 
immunoassays, and the like. 

For a more detailed discussion of the above inmnmoassay techiuques, see "Enzyme- 
Inmmnoassay," by Edward T. Ma^o, CRC Press, hic, Boca Raton, Fla., 1980. See also, for 
example, U.S. Pat. Nos. 3,690,834; 3,791,932; 3,817,837; 3,850,578; 3,853,987; 3,867,517; 
3,901,654; 3,935,074; 3,984,533; 3,966,345; and 4,098,876, which listing is not intmded to be 
exhaustive. Methods for conjugating labels to antibodies and antibody fragmmts are well known in 
the art Such methods may be found in U.S. Pat. Nos. 4,220,450; 4,235,869; 3,935,974; and 
3,966,345. Another example of a technique in which the antibodies of the invration may be employed 
is immunoperoxidase labeling. (Stemberger, Immunocytochemistry (1979) pp. 104-169). 
Alternatively, the antibodies may be bound to a radioactive material or to a drug to form a 
radiopharmaceutical or pharmaceutical, respectively. (Carrasquillo, et al., Canc^ Treatment Reports 
(1984)68:317-328). 
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vesnt of an assay en^loying an antibody of the present uxv^tion involves fhe use 
of a sur&ce to winch the monoclonal antibody of the invention is attached. The underl3ring structure 
of the sur&ce may take different forms, have different conq)05itions and may be a mixture of 
con9}ositions or laminates or combinations thereof The sur&ce may assume a variety of shapes and 
forms and may have varied dimensions, depending on &e mann^ of use and measurement. Illustrative 
sur&ces may be pads, beads, discs, or strips which may be flat, concave or convex. Thickness is not 
critical, gmerally being from about 0.1 to 2 mm thick and of any cmvouent diameter or oth^ 
dimensions. The sur&ce typically will be supported on a rod, tube, c^illary, fiber, strip, disc, plate, 
cuvette and will typically be porous and polyfimctional or capable of being polyfunctionalized so as 
to permit covalent binding of an antibody and permit bonding of other compounds which form a part 
of a means for producmg a detectable signal. A wide variety of organic and inorganic polymers, both 
natural and synthetic, and combinations thereof, may be employed as the nraterial for the solid 
sur&ce. Illustrative polymo-s include polyethylene, polypropylene, poly(4-methylbutene), 
polystyrme, polymethraciylate, poly(ethylene t^^hlhalate), rayon, nylon, poly(vinyl butyrate), 
silicones, polyformaldehyde, cellulose, cellulose acetate, nitrocellulose, and latex. Other surfaces 
include paper, glasses, ceramics, metals, metaloids, semiconductor materials, cem^ts, silicates, or die 
like. Also included are substrates that form gels, gelatins, lipopolysaccharides, silicates, agarose and 



glycols Calkylene of 2 to 3 carbon atoms) or surfactants such as phospholipids. The binding of the 
antibody to die surface may be acconq>lished by well known techniques, commonly available in die 
literature (see, for exan^le, 'Immobilized En^mjes," Ichiro Chibata, Press, New York (1978) and 
Cuatrecasas, J. Bio. Chem., 245: 3059 (1970)). In carrying out the assay in accordance with this 
aspect of the invention, the sample is mixed with aqueous medium and the medium is contacted with 
the sur&ce having an antibody bound thereto. Labels n^y be included in die aqueous medium, eidier 
concunratty or added subs6q*«^tly so as to provide a detectable signal associated with the sur&ce. 
The means for producing the detectable signal can involve the incorporation of a labeled anafytQ or it 
may involve the use of a second monoclonal antibody having a label conjugated thereto. Separation 
and washing steps will be carried out as needed. The signal detected is related to the presence of CPP 
in the sanq>le. It is within the scope of die present invention to include a calibration on the same 
support A particular embodiment of an assay in accordance with the presmt invention, by way of 
illustration and not limitation, involves the use of a support such as a slide or a well of a petn dish. 
The technique involves fixing die saxnplc to be analyzed on the siqiport with an appropriate fixing 
material and incubating the sample on the slide with a monoclonal antibody. After washing with an 
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appropriate buffer such as, for exaosple, phosphate buffered saline^ the support is contacted with a 
labeled specific binding partner for the antibody. After incubation as desired, the slide is wasted a 
second time with an aqueous buffer and the determination is made of the binding of die labeled 
monoclonal antibody to the antigen. If the label is fluorescent, the slide may be CGveied with a 
fluorescent antibody mounting fluid on a cover slip and then examined with a fluorescent microscope 
to determine the extent of binding. On the other hand, the label can be an enzyme coi^'ugated to the 
monoclonal antibody and the extent of binding can be determined by examining the shde for the 
presence of enzyme activity, which may be indicated by the formation of a precipitate, color, etc. A 
particular example of an assay utilizing die present antibodies is a double detenninant ELISA assay. 
A siq>port such as, e.g., a glass or vinyl plate, is coated with an antibody specific for CPP by 
convCTtional techniques. The support is contacted with the sanq>le suspected of containing CPP, 
usually in aqueous medium. Afta: an incubation period from 30 seconds to 12 hours, die support is 
separated from the medium, washed to remove unboimd CPP with, for example, water or an aqueous 
buffered mediimi, and contacted with an antibody specific for CPP, again usually in aqueous medium. 
The antibody is labeled with an enzyme directiy or indirectly such as, e.g., horseradish peroxidase or 
alkaline phosphatase. After incubation, die support is separated from die medium, and washed as 
above. The enzyme activity of the sv^port or flie aqueous medium is determined. This enzyme activity 

For an exarr^le of immunoassays detecting the CPP 9 polypeptides of the invraition, see 
Kurokawa, E. et al., CIin.Chmi.Acta (1983), 128(l):83-93. 

The invention also includes kits, e.g., diagnostic assay kits, for carrying out the metiiods 
disclosed above. In one embodiment, the kit conq>rises in packaged combination (a) a monoclonal 
antibody more specifically defined above and (b) a conjugate of a specific binding partner for the 
above monoclonal antibody and a label capable of producing a detectable signal. The reagents may 
also include ancillary agents such as bxiffering agents and protein stabilizing agents, e.g., 
polysaccharides and the like. The kit may further include, where necessary, otiier members of the 
signal producing system of which system the label is a member, agents for reducing backgroimd 
interfiarence in a test, control reagents, apparatus for conducting a test, and the like, hi another 
embodiment, the diagnostic kit coiiQ>rises a conjugate of monoclonal antibody of the invention and a 
label capable of producing a detectable signal. Ancillaiy agents as mentioned above may also be 
present. 

Furth^, an anti-CPP antibody (e.g., mcmoclonal antibody) can be used to isolate CPPs by 
standard techniques, such as affinity chromatography or immunoprecipitation. For exanq>le, an anti- 
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CPP antibocfy can facilitate the pmification of natural CPPs from cells and of recombinantly produced 
CPP expressed in host cells. Moreover, an anti-CPP antibody can be used to isolate CPP to aid in 
detection of low concentrations of CPP (e.g., in plasma, cellular lysate or cell supernatant) or in order 
to evaluate the abundance and pattern of expression of the CPP. Anti-CPP antibodies can be used 
diagnostically to monitor protein levels in tissue as part of a clinical testing procedure, e.g., to 
determine the efficacy of a g^ven treatment regimen. Detection can be facilitated by coupling (i.e., 
physicaUy linking) the antibody to a label group. 

Protein Arrays 

Detection, purification, and screening of the polypeptides of the invention may be 
accon^lished using retentate chromatography (preferably, protein arrays or ch^s), as described by 
U.S. Patmt 6225027 and U.S. Patent Application 20010014461, disclosures of which are herein 
incoxporated by reference in their entireties. Briefly, retentate chromatography describes methods in 
which polypeptide (and/ or other sample conq>onents) are retained on an adsorbent (e.g., array or 
chip) and subsequently detected. Such methods involve (1) selectively adsorbing polypeptides from a 
san^le to a substrate under a pluraUty of different adsorbent/eluant combinations ("selectivity 
conditions") and (2) detecting the retention of ads<nbed polypeptides by desorption spectrometry (e.g., 

adsorbent prior to detectian. The coupling of adsorption chromatography with detection by 
desorption spectrometry provides extraordinary sensitivity, the ability to rapidly analyze retained 
con:q}onents with a variety of different selectivity conditions, and parallel processing of con^Kments 
adsorbed to different sites (i.e., "affinity sites" or "spots") on the array under different elution 
conditions. 

These methods are useful for: combinatorial, biochemical separation and purification of the 
CPPs; study of differential gene expression; detection of differences in protein levels (e.g., for 
diagnosis); and detection of molecular recognition events, (e.g., for scaieening and drug discovery). 
Thus, this invention provides a molecular discovery and diagnostic device that is characterized by the 
inclusion of both parallel and multiplex polypeptide processing capabilities. Polypeptides of the 
invention and CPP-binding substances are preferably attached to a label group, and thus directly 
detected, enabling simultaneous transmission of two or more signals from the same "circuit" (i.e., 
addressable "chip" location) dunng a single unit operation. 

Detection of CPPs by mass spectrometry 
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In accordance with tiie present invention, any instrument, method, process, etc. can be utilized 
to detennine the identity and abundance of proteins in a sample. A prefored method of obtaining 
identity is by mass spectrometry, where protein molecules in a san^le are ionized and ihsn the 
resultant mass and charge of the protein ions are detected and determined. 

To use mass spectrometry to analyze proteins, it is preferred that the protein be converted to a 
gas-ion phase. Various methods of protein ionization are useful, including, e.g., fast ion bombardment 
(FAB), plasma desoiption, laser desorption, thermal d^orption, preferably, electro^>ray ionization 
(ESI) and matrix-assisted laser desorption/ionization (MALDI). Many different mass analyzers are 
available for peptide and protein analysis, including, but not limited to, Time-of-FhgJit (TOF), ion 
trap (TTMS), Fourier transform ion cyclotron (FTMS), quadrupole ion trap, and sector (electric and/or 
magnetic) spectrometers. See, e.g., U.S. Pat No. 5,572,025 for an ion trap MS. Mass analyzers can be 
used alone, or in combination with other mass analyzers in tandem mass spectrometers. In the latter 
case, a first mass analyzer can be use to separate the protein ions (precursor ion) fiom each oth^ and 
determine the molecular weights of the various protein constituoits in the sanq>le. A second mass 
analyzer can be used to analyze each separated constituents, e.g., by fragmenting the precursor ions 
into product ions by using, e.g. an inert gas. Any desired combination of mass analyzers can be used, 
including, e.g., triple quadrupoles, tandem time-of-flights, ion traps, and/or combinations tfaereo£ 

detectors can be utilized, such as ion electron multipliers or cryogenic detectors (e.g., U.S. Pat No. 
5,640,010). Additionally, non-destructive detectors can be used, such as ion traps which are used as 
ion current pick-up devices in quadnq>ole ion trap mass analyzers or FTMS. 

For MALDI-TOF, a number of sample preparation methods can be utilized including, dried 
droplet (Karasand Hillenkamp, Anal. Chem., 60:2299-2301, 1988), vacuum-<irying (Winberger et al., 
fix Proceedings of the 41st ASMS Conference on Mass Spectrometry and Alhed Topics, San 
Francisco, May 31-June 4, 1993, pp. 775a-b), crush crystals (Xiang et al., Rapid Comm. Mass 
Spectrom., 8:199-204,1994), slow crystal growing (Xiang et al.. Org. Mass Spectrom, 28:1424-1429, 
1993); active film (Mock et aL, Rapid Comm. Mass Spectrom.,6:233-238, 1992; Bai et al.. Anal. 
Chem, 66:3423-3430, 1994), pneumatic spray (Kochling et aL, Proceedings of the 43rd ASMS 
Conferraice on Mass Spectrometry and AUied Topics; Atlanta, GA, May 21-26, 1995, pl225); 
electrospray (Hensel et al, Proceedings of the 43rd ASMS Conference on Mass Spectrometry and 
Allied Topics; Atlanta, GA, May 21 -26, 1995, p947); fest solvent evaporation (Vorm et aL, 
Anal. ChOTi., 66:3281-3287, 1994); sandwich (Li et al., J. Am Chem Soc, 1 1 8:1 1662-1 1663,1996); 
and two-layer methods (Dal et al., Anal. Chem, 71:1087-1091, 1999). See also, e.g., Liang etal.. 
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Rapid Commun. Mass Spectrom., 10: 1219-1226, 1996; van Adrichemet al.. Anal. Chem., 70:923- 
930, 1998. 

For MALDI analysis, samples are prepared as solid-state co-crystals or thin films by mixing 
them wifli an energy absorbing coxnpoimd or colloid (the matrix) in the liquid phase, and ultimately 
drying the solution to the solid state upon the sur&ce of an inert probe. In some cases an energy 
absorbing molecule (EAM) is an integral component of the san:q>le presenting surfece. Regardless of 
EAM application strategy, the probe contaits are allowed to dry to the solid state prior to introduction 
into the las^ desoiption/ionization tim&-of-fiight mass spectrometer (LDIMS). 

Ion detection in TOF mass spectrometry is typically achieved with the use of electro-emissive 
detectors such as electron multipliers (EMP) or microclannel plates (MCP). Both of these devices 
function by converting primary incident charged particles into a cascade of secondary, tertiary, 
quaternary, etc. electrons. The probability of secondary electrons being generated by the impact of a 
single incident charged particle can be taken to be die ion-to-electron conva^on efficisicy of this 
charged particle (or more sinq>ly, die conversion efSciency). The total electron yield for cascading 
events when compared to the total number of incidrat charged particles is typically described as die 
detector gain. Because generally the overall response time of MCPs is far superior to that of EMPs, 
MCPs are the prefmed electro-emissive detector for enhancing mass/charge resolving power. 

response time and broad frequency bandwidth are not necessary. 

In a prefCTred aspect, for the analysis of digested proteins, a liquid-chromatography tandem 
mass spectrometer (LC-TMS) is used. This system provides an additional stage of san^le separation 
via use of a liquid chromatograph followed by tandem mass spectrometry. 

In preferred aspects, a protein eluted from a column according to the system described in 
£xan^)le 1 is analyzed using both MS and MS-MS analysis. For exanq)le, a small portion of intact 
proteins eluting from RP2 umy be diverted to online detecti(m using LC-ESI MS . The proteins are 
aliquoted on a number of plates allowing digestion or not with trypsin, preparation for MALDI-MS as 
well as for ESI-MS, as well as preparation of the MALDI plates with different matrices. The methods 
thus allow, in addition to information on intact mass, to conduct an analysis by both peptide mass 
fingerprinting and MS-MS techniques. 

Tlie methods described herein of separating and fractionating proteins provide individual 
proteins or fractions containing small numbers of distinct proteins. These proteins can be identifred 
by mass spectral determination of the molecular masses of the protein and peptides resulting from the 
firagmaitation thereofl Making use of available information in protein sequence databases, a 
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comparison can be made between proteolytic p^itide mass patterns g^erated in silica, and 
experimentaUy observed peptide ma^es. A "hit-list" can be compiled, ranking candidate proteins in 
the database, based on (among o&^ criteria) die number of matches between die theoretical and 
experimental proteolytic fragments. Several Web sit^ are accessible that provide software for protein 
3 identification on-line^ based on peptide mapping and sequence database search strategies (e.g., 

htlp://www.e3q>a5y.ch). Me&ods of peptide mapping and sequencing using MS axe described in WO 
95/252819, U.S. Pat No. 5,538,897, U.S. Pat No. 5,869,240, U.S. Pat No. 5,572,259, and U.S. Pat 
No. 5,696,376. See, also, Yates, J. Mass Spec., 33:1 (1998). 

Data collected from a mass spectrometer typically comprises die intensity and mass to charge 

10 ratio for each detected event. Spectral data can be recorded in any suitable form, including, e.g., in 
graphical, numerical, or electronic formats, either in digital or analog form. Spectra are preferably 
recorded in a storage medium, including, e.g., magnetic, such as floj^y di^ t^, or hard disk; 
optical, such as CD-ROM or laser-disc; or, ROM-CHIPS. 

The nms spectrum of a given san^le typically provides information on protein intraisity, 

15 mass to charge ratio, and molecular weight hi preferred embodim^its of the invration, the molecular 
weights of proteins in the ^mple are used as a matching criterion to query a database. Hie molecular 
weights are calculated conventionally, e.g., by subtracting the mass of the ionizing proton for singly- 

cbarges for mult^ly-charged ions and subtracting the number of ionizing protons. 

20 Various databases are usefol in accordance with the present invention. Useful databases 

include, databases containing genomic sequences, expressed grae sequences, and/or e^qiressed protein 
sequences. Preferred databases contain nucleotide sequence-derived molecular ma^es of proteins 
present in a known organism, organ, tissue, or cell-type. There are a number of algorithms to identify 
open reading frames (ORF) and convert nucleotide sequences into protein sequence and molecular 

25 weight information. Several publicly accessible databases are available, including, the 
SwissPROT/TrEMBL database (ht^://www.e3q3asy.ch). 

Typically, a mass spectrometer is equipped with commercial software that idmtities peaks 
above a certain threshold level, calculates mass, charge, and intensity of detected ions. Correlating 
molecidar weigiht witii a given output peak can be accomplished directly from die spectral data, Le., 

30 where the charge on an ion is one and the molecular wei^t is th^^ore equal to the cfumi^rator value 
minus the mass of the ionisdng proton. However, protdn ions can be compiexed with various coimter- 
ions and adducts, such as N, C, and K*. hi such a case, it would be expected that a givea protein ion 
would exhibit multiple peaks, such as a triplet, r^esenting different ionic states (or species) of the 
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same protein. Thus» it may be necessary to analyze and process spectral data to determine funilies of 
peaks arising from the same protein. This analysis can be canied out convCTtionally, e.g., as described 
by Mann et al., anal. Chem., 61:1702-1708 (1989). 

hi matching a molecular mass calculated from a noass spectrometer to a molecular mass 
predicted Scam a database, such as a genomic or expressed gene database, post-translation processing 
may have to be considered. There are various processing events ^^ch modify protein structure, 
including, proteolytic processing, removal of N-tmninal methionine, acetylation, m^hylation, 
glycosylation, phosphorylation, etc. 

A database can be queried for a range of proteins matching Ihe molecular mass of the 
unknown. The range window can be det^mined by the accuracy of the instnun^t, the method by 
which the sample was prepared, etc. Based on die number of hits (wh^ a hit is n^tch) in the 
spectrum, the unknown protein or peptide is identified or classified. 

Methods of identifying one or more CPP by mass ^ectrometry are useful for diag;no5is and 
prognosis of cardiovascular disorders. Preferably, such methods are used to detect one or more CPP 
present in human plasma. Exemplary techniques are described in U.S. Patent Applications 
02/0060290, 02/0137106, 02/0138208, 02/0142343, 02/0155509, disclosures of which are 
incorporated by reference in their entireties. 

Diagnostic and Prognostic Uses 

The nucleic acid molecules, proteins, protein homologues, and antibodies described herein 
can be used in one or more of the following methods: diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenetics; and in drug screening as further described herein. 

The invention provides diagnostic and prognostic assays for detecting CPP nucleic acids and 
proteins, as fiirther described. Also provided are diagnostic and prognostic assays for detecting 
interactions between CPPs and CPP target molecules, particularly natural agonists and antagonists. 

The present invmtion provides methods for identifying polypeptides that are difTerentially 
expressed between two or more samples. "Differential expression" refers to differences in the quantity 
or quality of a polypeptide between samples. Such diff^^rraces could result at any stage of protein 
expression fi-om transcription through post-translational modification. For example, using protein 
array methods, two samples are bound to afSnity spots on different sets of adsorbents (e.g., chips) and 
recognition maps are campaied to idratify polypeptides diat are differentiaUy retained by the two set$ 
of adsorb»ts. Differential retention includes quantitative retention as well as qualitative differences 
in the polypeptide. For example, differences in post-translational modification of a protein can result 
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in di£^etices in recognition maps detectable as differences in binding characteristics (e«g.^ 
glycosylated proteins bind differently to lectin adsorbents) or diffidences in mass (e.g., post- 
translational cleavage products). In certain embodiments, an adsorbent can have an array of afBnity 
spots selected for a combination of markers diagnostic for a disease or syndrome. 

Differences in polypeptide levels betwea samples (e.g., differentially expressed CPPs in 
plasma sa]iq)les) can be identified by exposing the san^les to a variety of conditions for analysis by 
desorption spectrometry (e.g., mass spectrometry). Unknown proteins can be identified by detecting 
physicochemical characteristics (e.g., molecular mass), and this information can be used to search 
databases for proteins having similar profiles. 

Preferred methods of detecting a CP? utilize mass spectrometry techniques. Such methods 
provide information about the size and character of fee particular CPP isoform that is present in a 
sample, e.g., a biological saniple submitted for diagnosis or prognosis. Mass spectrometry techniques 
are detailed in tiie section titled 'T)etection of CPPs by mass spectrometry". Example 1 outhnes a 
preferred detection scheme, wherein a biological sanq>le is separated by chromatography before 
chaiact^ization by nmss spectrometry. The invention provides a method of detecting a CPP in a 
biological saixQ>le conqnising fee steps of: fractionating a biological sanq>le (e.g., plasma, serum, 
lympb, cerebrospinal fluid, cell l3^sate of a particular tissue) by at least one chromatographic step; 

observed in mass speclrometzy wife known charactmstics of CPP polypeptides (e.g., CPP 2, CPP 9, 
CPP 17, CPP 20 and CPP 21, as disclosed in Table 1). 

The isolated nucleic acid molecules of the invention can be used, for example, to detect CPP 
mRNA (e.g., in a biological sample) or a genetic alteration in a CPP-encoding gene, and to modulate a 
CPP activity, as described furfeer below. In addition, fee CPPs can be used to screen for naturally 
occ-ucring CPP target molecules, and to screm for drugs or con^unds which modulate CPP activity. 
Moreover, fee anti- CPP antibodies of the invention can be used to detect and isolate CPPs, regulate 
fee bioavailability of CPPs, and modulate CPP activity. 

Accordingly one embodiment of fee present invention involves a mefeod of use wherein a 
molecule of fee present invention (e.g., a CPP, CPP nucleic acid, or CPP modulator) is used, for 
example, to diagnose, and/or prognose a disorder in which any of fee aforementioned CPP activities is 
indicated, in anofeer embodiment, fee present invention involves a mefeod of use vtiierein a molecule 
of fee present invention is used, for exanqile, far fee diagnosis, and/or prognosis of subjects, 
preferably a hunmn subject, in which any of the aforementioned activities is pathologically perturbed 

For example, fee invention enconqsasses a mefeod of determining whefeer a CPP is expressed 
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within a biological san^le conqmsing: a) contacting said biological sample with: i) a polynucleotide 
that hybridizes undo: stringent conditions to a CPP nucleic acid; or ii) a detectable polypeptide (e.g. 
antibody) that selectively binds to a CPP; and b) detecting the presence or absence of hybridi^tion 
between said polynucleotide and an KNA species within said sample, or the presence or abs^ce of 
binding of said detectable polypeptide to a polypeptide within said san^le. Detection of said 
hybridization or of ^d binding indicates that said CPP is expressed within said sample. Pteferably, 
the polynucleotide is a primer, and said hybridization is detected by detecting the piesence of an 
an^lification product comprising said primer sequence, or the detectable polypeptide is an antibody. 

In obtain embodiments, detection involves the use of a probe/primer in a polymerase chain 
reaction (PCR) (see, e.g., U.S. Pat. Nos. 4,683,195 and 4,683,202, fiie disclosures of v^ch are 
mcorporated herein by reference in ^ir entireties), such as anchor PGR or RACE PCR, or, 
alternatively, in a ligation chain reaction (LCR) (see, e.g., Landegren et al. (1988) Science 241: 1077- 
1080; and Nakazawa et al. (1994) PNAS 91:360-364, die disclosures of which are incorporated herein 
by ref^ence in their entireties), the latter of which can be particularly useful for detecting point 
mutations in the CPP-encoding-gene (see Abravaya et al, (1995) Nucleic Acids Res, 23:675-682, the 
disclosure of which is incorporated herein by reference in its entirety). 

Also envisioned is a method of determining wheth^ a mammal, prefiraably human, has an 

said mammal; and b) comparing the amount of a CPP or of a CPP RNA species encoding a CPP 
within said biological sample with a level detected in or expected from a control sanqile. An increased 
amount of said CPP or said CPP RNA species within said biological sanq)le conq)ared to said level 
detected in or expected from said control sangjle indicates that said mammal has an elevated level of 
CPP expression, and a decreased amount of said CPP or said CPP RNA species witiiin said biological 
^mple compared to said level detected in or expected from said control san:q)le indicates that said 
mammal has a reduced level of expression of a CPP. 

The present invention also pertains to the field of predictive medicine in which diagnostic 
assays, prognostic assays, and monitoring clinical trials are used for prognostic purposes. 
Accordingly, one aspect of the present invention relates to diagnostic assays for determining CPP 
and/or nucleic acid esqiression as well as CPP activity, in the context of a biological san^le (e.g., 
blood, plasimi, cells, tissue) to thereby determine ^e&er an individual is afflicted with a disease or 
disorder, or is at risk of developing a disorder, associated with aberrant CPP expression or activity. 
The invention also provides for prognostic (or predictive) assays for determining whdh^ an 
individual is at risk of developing a disorder associated with a CPP, nucleic acid expression or 
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activity. For example, mutations in a CPP-encoding gene can be assayed in a biological sample. Such 
assays can be used for prognostic or predictive purpose to thereby prophylactically treat an individual 
prior to the onset of a disorder characterized by or associated witii CPP expr^sion or activity. 

The term *l>iologica] sample" is int^ded to include tissues, cells and biological fluids isolated 
from an individual, as well as tissues, cells and fluids preset within an individual. That is, the 
detection methods of the invention can be used to detect a CPP noRl^ protein, or gnomic DNA in a 
biological ^nQ)le in vitro as well as in vivo. Preferred biological samples are biological fluids such as 
lymph, cerebrospinal fluid, blood, and especially blood plasma. For example, in vitro techniques for 
detection of a CPP mRNA include Northern hybridizations and in situ hybridizations, hi vitro 
techniques for detection of a CPP include mass spectZDmetry, Enzyme Linked hnmuno Sorbent 
A^3^ (BLISAs), Western blots, immunoprecipitati<ms and innnunofluorescence. hi vitro techniques 
for detection of a CPP-encoding gmomic DNA include Southmi hybridizations. Furthermore, in vivo 
techniques for detection of a CPP include introducing into an individual a labeled anti- CPP antibody. 

For an example of immunoassays detecting the CPP 9 polypeptides of the invention, see 
Kurokawa, E. et al., ain.ChmiActa (1983), 128(l):83-93. 

hi preferred embodiments, the subject methods can be characterized by generally comprising 
detecting, in a tissue sanq>le of the individual (e.g. a human patient), the presmce or absence of a 

<JP¥ or <ii> tiie mis-esqiression of a CPP-encoding grae. To illustrate, such genetic lesions can be 
detected by ascertaining tiie existence of at least one of (i) a deletion of one or more nucleotides from 
the CPP-encoding gene, (ii) an addition of one or more nucleotides to the gene, (iii) a substitution of 
one or more nucleotides of the gene, (iv) a gross chromosomal rearrangement or amplification of the 
g«ie, (v) a gross alteration in the level of a messengCT RNA transcript of the gene, (vi) aberrant 
modification of the gene, such as of the methylation pattern of the genomic DNA, (vii) the presence of 
a non-wild type splicing pattern of a messenger RNA transcript of the grae, and (viii) reduced level of 
expression, indicating lesion in regulatory element or reduced stability of a CPP-encoding transcript. 

In yet another exemplary embodiment, abOTant methylation patterns of a CPP nucleic acid 
can be detected by digesting genomic DNA from a patient sample with one or more restriction 
endonucleases that are sensitive to methylation and for which recognition sites exist in the CPP- 
encoding gme (including in the flankmg and intronic sequences). See, for example, Buiting et al. 
(1994) Human Mol Genet 3:893-895. Digested DNA is separated by ge! electrophoresis, and 
hybridized witii probes derived from, for ^cairqile, genomic or cDNA sequraces. The methylation 
status of the CPP-encoding gene can be determined by comparison of fee restriction pattern generated 
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from the ^ti:^>le DNA with tiiat for a standard of known meihylation. 

In yet another embodiment, a diagnostic assay is provided which detects the ability of a CPP 
to bind to a cell surface or exfraceUular protein. For instance, it will be desirable to detect CPP 
mutants winch, while expressed at appreciable levels in the cell, are defective at bixiding a CPP target 
protein (having either diminished or enlaced binding affinity for the target). Such mutants xmy arise, 
for example, from mutations, e.g., point mutants, which may be inqn^ctical to detect by the diagnostic 
DNA sequencing techniques or by the immunoassays described above. The present invention 
accordingly further contenq^lates diagnostic screening assays which generally comprise cloning one 
or more CPP-encoding gene from the sample tissue, and expressing tiie cloned genes under conditions 
which pemnt detection of an interaction between that recombinant gene product and a target protein. 
As will be a^^aient from the description of die various drug screening assays set fordi herein, a wide 
variety of techniques can be used to determine die ability of a CPP to bind to other components. 
These techniques can be used to detect mutations in a CPP-mcoding gene which give rise to mutant 
proteins widi a higher or lower binding affinity for a CPP target protein relative to the wild-type CPP. 
Conversely, by switching which of the CPP target protein and CPP is the 'Tjait" and which is derived 
from the patient sample, the subject assay can also be used to detect CPP target protein mxitants which 
have a higher or Iowct binding affinify for a CPP relative to a wild type form of that CPP target 

In an exenqdaiy embodiment, a target protein can be provided as an immobilized protein (a 
'^target"), such as by use of GST fusion proteins and glutathione treated microtiter plates as described 

hsreia. 

In another embodiment, the methods flirther involve obtaining a control biological sample 
from a control subject, contactic^ the control sample with a con^imd or agmt capable of detecting a 
CPP, mKNA, or goiomic DNA, such that the level of a CPP, mRNA or genomic DNA is measured in 
the biological san:q)le, and conq)aring the level of a CPP, mRNA or genomic DNA in the control 
sample with the level of a CPP, mRNA or goiomic DNA m the test sanq>le. The inv^tion also 
encompasses kits for detecting the presence of a CPP, mRNA or genomic DNA in a biological 
sairqsle. For exan^le, the kit can contuse: a labeled compound or agent capable of detecting a CTP, 
mRKA or genomic DNA in a biological sample; means for determining the amount of a CPP in die 
sample; and m^ns for con^aring the amount of CPP in the sample with a standard. The compound or 
agent can be packaged in a suitable container. The kit can further contqnise instructions for using the 
kit to detect CPP or nucleic acid. 
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CPPs clusters 

In one aspect of the invention, me&ods for the diagnosis of cardiovascular disorders conoprise 
detecting in a test biological saD^}le tiie presence or level of one or more CPP of the invention in 
combination with ^e detection of other Cardiovascular disorder Plasma Polypeptides (CPPs). 
Particularly prefened o&er CPPs for use in the diagnosis of cardiovascular disorders in combination 
with CPPs of the invention are listed in Table 2. 
Table 2 



Table 2 


CPP# J CEX 


RP1 


Tryptic Sequsnces (RP2) 


j| CPP 8 j 18 


6 


CLDPVDTPNPTR (7-*), YKKPECQSDWQCPGK (8) 


«»P 8 ^ 18 


7 


' CU:W=VDTPNPTR (7-8) 


CPP 8 


18 


11 


CLOPVDTPN'PTR (5) 

f 


CPP8 


18 


13 


(XDPVOTPNPTR (3) 


CPP 12 


1C 


8 


QSGEDNQDLAJSFAGNGLSALR (8-9) 


CPP 12 


11 


\ 8 


ESLSGVCEIS^ (9). QSGEDNQDLAISFAGNGLSALR (9) ' 


CPP 12 


11 




QSGEDNQDUISFAGhTGLSAlR (9) 


CPP 12 


11 


10 


QSGEDNQDLAISFAGNGLSALR (7) 


CPP 12 


11 


11 


ESLSGVCEISGR (7) 


CPP 13 


13 


14 


VSAQQVQGVHAR (9. 12) 


CPP 13 
X;PP13^ 


13 

43 


18 

|- 20 - 


FPVYDYDPSSLR (5). VNSQSLSPYLFR (5-6) 

tJrr VSTOVCR (4-5). FPVYDVOPSSLR (5). VMSQSLSPVLFR (4-5), VSAQQVQGV/HAR (5) 


CPP 13 


13 21 

1 


DYWSTAVCR (5), VNSQSLSPYLFR (5) 


CPP 13 


13 


i 22 


DYYVSTAVCR (10), FPVYDYDPSSLR (3). VNSQSLSPYLFR (3-4) 


CPP 13 


13 


23 


VNSQSLSPYuFH (3) 


CPP 13 


13 


25 


DYYVSTAVCR (1). VNSQSLSPYLFR (1) 


CPP 13 




13 


DALSASWK (15), DSGEDPATCAFQR (15), FPVYDYDPSSLR (15). VNSQSLSPYLFR 

(14). VSAQQVCK3VKAR (15) 


CPP 13 


14 


15 


DSC^DPATCAFQR (10), VNSQSLSPYLFR (10) 


CPP 13 


14 


19 1 VSAQQVQGVHAR <7) 


CPP 13 


14 


21 VSAQQVQGVHAR (5. 7. 8) 


CPP 13 


14 


22 i 


VNSQSLSPYLFR (3) 


CPP 13 


14 


25 


VSAQOV(3GVHAR (2) 


CPP 13 


15 


13 


VNSQSLSPYLFR (17-18) 


CPP 13 


15 


15 


VNSQSLSPYLFR (11) 


CPP 13 


18 1 22 


VSA(3aVQGVHAR (3) 


CPP 14 


15 3 


4 


CSVSLRPiGASCR (10) ' 


CPP 14 


16 


6 


(3VSLRPI(3ASCR (9) ' 


CPP 14 


17 


5 


GVSLRP](3ASCRDDSECrrR (9-10) 


CPP 14 


18 


7 


GVSLI^.GASCR (9) 


CPP 15 


2 


7 


LQCYNCPNPTADCK (24) 


CPP 15 j 


2 


« 1 


AGLQVYJsfK (15), LQCYNCPMPTAOCK (15. 17. 18), LRENEITYYCCK (16-18) 
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Table 2 


CPP# 


CEX 


RP1 


Tryptfc Sequences (RP2) 


CPP 15 


2 


9 


ENELTYYCCK (17), FEHCNFNDVTTR (17), LQCYNCPNPTADCK (17), LRENELTYYCCK 
(16--! 7) 


CPP 15 


2 


10 


LQCYNCPNPTADCK (12), LRENELTYYCCK (12) 


CPP 15 


3 


9 


FEHCNFNOVTTR (15-16), LQCYNCPNPTADCK (15-16) 


CPP 15 


3 


10 


AGLQNAm (9, 11), i=EHCNFNDVrrm (10-11). LCXJYNCPNPTADCK (8-11, 16). | 

LREMELTYYCCK (9-11.13) j 


CPP 15 


3 


11 


FeHCNFNDVTTR (9-10). L(5CYNCPNPTAEK:K (9-11), LRENELTYYCCK (9-1 1) 


CPP 15 


3 


12 


LQCYNCP>SPTADCK (7) 


CPP 15 j 3 


13 


LQCY^SCPKPTADCK (7-8) 


CPP 15 


4 


9 


FEHCNFNDVTTR (14-15). LQCYNCPJ^ADCK (14-16), LRENELTYYCCK (14-15) 


CPP 15 


4 


10 


FEHCNFNDVTTR (9-11), LQCYNCPNPTADCK (9-10), LRENELTYYCCK (10) 


CPP 15 


5 


11 


AGLQVYNK (10), FEHCNFNDVTTR (10) 


CPP 15 


6 


9 


LC5CYNCPNPTADa< (14) 


CPP 15 


6 


10 


FEHCNFNDVFTR (8) 


CPP 15 


6 


11 


TAVNCSSDFDACUTK (10) 


CPP 15 


7 


10 


FEHCNFNDVTTR (9) 


CPP 16 


10 


15 


CLTTDEYDGHSTYPSHQYQ (12), TVAGQDAVIVLLGTR (10), 
YVAVMPPHIGDQPLTGAYTVTLDGR (11) 


CPP 16 


10 


16 


CLTTDEYDGHSTVPSHQYQ (9), LQAVTDDHIR (9), YVAVMPPHKaDQPLTGAYTVTLDGR 

(9) 


CPP 16 


10 


19 

16"" 


NDLSPTTVlytSEGAR (7) 
[ TVACK3DAV1 VlLGTR (9) \ 


CPP 16 


13 


17 


NDLSPTTVMSEGAR(IO) \ 


CPP 18 


10 


11 


QCIHQLCFTSLR (15-19) =1 


CPP 18 




6 


LPPCENVDLQRPNGL (13) ] 


CPP 18 


11 


7 


SNYFRLPPCeNVDLQRPNGL(13) i 


CPP 18 


11 


10 


QCiHQLCFTSLR (14-15) 


i CPP 16 


11 


11 


QCIHCH-CFTSLR (12-17, 19. 20) 


CPP 18 


12 


7 


LPPCENVDLQRPNGL (12), SNYFRLPPCENVDLQRPNGL (9-1 1) 


CPP 18 


12 


10 


QCIHQLCFTSLR (10. 14) 


CPP 18 


12 


11 


LYSVHRPVK (1 1 ). QCIHQLCFTSLR (1 1 ) 


CPP 18 


13 f 7 

! 1 


LFWENVDLQRPNGL (15), SNYFRLPPCENVDLQRPNGL (15) 


CPP 19 


3 ) 7 


MSSSYPTGLADVK (10) 


CPP 19 


4 


7 


AGPAQTLlRPQDfelK (10). MSSSYPTGLADVK (10), 
MSSSYPTGLADVKAGPAQTURP(3D7b3K (10) 


CPP 40 


17 


20 


EDPTVSALLTSEK (9). VPSLVGSFIR (8-9) 


CPP 40 


17 


22 


VPSLV6SFIR (7) 


CPP 40 


1 ^« 


20 


VPSLVGSFIR (9) 


CPP 41 12 


12 


LQNNE>J?^SCVER (9), STDTSCVNPPTVQP^AHILSR (9) 


CPP 41 


14 


12 


TGESAEFVCK (9) 


CPP 41 


15 


1 '•I 


rrCTEEGWSPTPK (15-16), STDTSCVNPPTVQNAHILSR (15-16), TGESAEFVCKR (16) 


CPP 41 


16 j 29 


YKPFSQVPTGcVFYYSCEYNFVSPSK (1) 
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Table 2 


1 CPP# 


CEX 


RP1 


Tiyptic Sequences (RP2) 


CPP 41 


17 


12 


CLHPCVISR (9). EiMENYNIALR (9), INHGJLYDEEK (9). ITCTEEGWSPTPK (9), 
LQNKENNl«:VER (9). SFWTRITCTEEGWSPTPK (9). STDTSCVNPPTVQNAHILSR (9). 
TGES/^FVCKR (9). TTCWDGKLEYPTCAK (9) 


CPP41 


17 


13 


CLHPCViSR (9), EATFCDFPK (9), EIMENY541ALR (9). GWSTPPK (9). IKHGILYDEEK (9). 
ITCTEEGWSPTPK (9), LQNMENNISCVER (9). STDTSCVIsiPPTVQNAHILSR (9), 
TTCWDGKLEYPTCAK (9) 


CPP 41 


17 


14 


aWENYNJALR (7), INHGILYDEEK (7), ITCTEEGWSPTPK (7). LCM^fNENNISCVER (7), 
STDTSCVNPPTVQNAHILSR C7). YKPFSQVPTGEVFYYSCEYNFVSPSK (7) 


CPP 41 


17 


\ 16 
1 


EATFCDFPK (5), EIMENYNIALR (5), LQNNENMSCVER (5). STDTSCVNPPTVQN/^ILSR 
{5). TGESAEFVCK (5). TTCWDGKLEYPTCAK (5) 


CPP 41 


17 


\ 

i 

18 


CLHPCVISR (4). EATFCDFPK (4), EIMENYNIALR (4>. INHGILYDEEK (4), 
rrCTEEGWSPTPK (4). LrYPTCAK (4). SFWTK (4). STDTSCVNPPTVQ^fAHlLSR (4), 
TTCWDGKLEYPTCAK (4) 


CPP 41 


17 


20 


EATFCDFPK (3). EIMENYNIALR (3), INHGILYDEEK (3). ITCTEEGWSPTPK <3>. 
LQNNENNISCVER (3). STDTSCVNPPTVQNAHILSR (3), TGESAEFVCKR (3). 
TTCWDGKLEYPTCAK (3). YKPFSQVPTGEVFYYSCEYNFVSPSK (3) 


CPP 41 


17 


22 


EATFCDFPK (1), EIMENYNIALR (1), INHGILYDEEK (1), ITCTEEGWSPTPK (1). j 
NGQWSEPPKCLHPCVISR (1), SFWTRITCTEEGWSPTPK (1). 
STDTSCVNPPTVQNAHILSR (1). TTCWDGKLEYPTCAK (1) 


CPP 41 


17 


26 


CLHPCVISR (1), EATFCDFPK (1). INHGILYDEEK (1). ITCTEEGWSPTPK (1), 
NGQWSEPPK (1). STDTSCVNPPTVQNAHILSR (1), TGESAEFVCK (1). 

ITCTEEGWSPTPK (1). STDTSCVNPPTVQNAHILSR (1). TGESAEFVa<R (1) 


CPP 41 


17 


■ 

27 


CLHPCVISR (1). EATFCDFPK (1), EIMENYNIALR (1), INHGILYDEEK (1). 
ITCTEEGWSPTPK (1), LEYPTCAK (1). LQNNENNISCVER (1), SFWTR (1). 
STDTSCVNPPTVQNAHILSR (1) 


CPP 41 


17 


29 


CLHPCVISR (1). EATFCDFPK (1), EIMENYNIALR <1), INHGILYDEEK (1), 
ITCTEEGWSPTPK (1), LQNNENNISCVER (1), STDTSCVNPPTVQNAHILSR (1). 
TGESAEFVCK (1). TTCWDGKLEYPTCAK (1). YW=»FSQVPT<^VFYYS(^NFVSPSK (1) 


CPP 41 


17 


30 


CLHPCVISR (1). EIMENYNIALR (1). INHGILYDEEK (1). ITCTEEGWSPTPK (1). 
LQNNENNISCV^ (1), STDTSCVNPPTVQNAHILSR (1), TTCWDGKLEYPTCAK (1) 


CPP 41 

i 


18 


12 


CLHPCVISR (9-10). EATFCDFPK (9-10), EIMENYNIALR (9), INHGILYDEEK (9-10), 
ITCTEEGWSPTPK (9). LaNNENNlSCVER (9-10). STDTSCVNPPTVQNAHILSR (9-10). 
TGESAEFVCKR (9), TTCWDGKLEYPTCAK (9-10) 


CPP 41 


16 


18 


EATFCDFPK (4). EIMENYNIALR (4). LQNNENNISCVER (4). STDTSCVNPPTVQNAHILSR 
(4). TTCWDGKLEYPTCAK (4) 


CPP 41 


18 


19 


EATFCDFPK (4). EIMENYNIALR (4), INHGILYDEEK (4). ITCTEEGWSPTPK (4). 
LQN>{ENNISCVER (4), NGQWSEPPK (4). STDTSCVNPPTVQNAHILSR (4) 


CPP 41 


18 


20 1 


LQN>fENNISCVER (3). STDTSCVNPPTVQNAHILSR (3), TGESAEFVCKR (3), 
TTCWDGKLEYPTCAK (3) 


CPP 41 


18 


22 ( 


EATFCDFPK (1). INHG!LYDEB< (1), ITCTEEGWSPTPK (1). LQNNENNISCVER (1). 
STDTSCVNPPTVQNAHILSR (1 ). TGESAEFVCKR (1 ). TTCWOt«<LEYPTCAK (1 ) j 
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Table 2 ! 


CPP# 


CEX 


RP1 


1 Tryptte Sequences (JRP2) 


CPP 41 


18 


26 



CLHPCVISR (1), EATFCDFPK (1), EIMENYN3ALR (1). INHGILYDEEK (1). 
ITCTEEGWSPTPK (1). STDTSCVKPPTVaNAHILSR <1), TGESAEFVCK <1> 


I CPP 41 


18 


29 


EATFCDFPK (1), aWENYNyVLR (1). INHGILYDEEK (1). ITCTEEGWSPTPK (1), 
LQNNENJsilSCVER (1). STDTSCVNPPTVQNAHILSR (1), TTCWDGKLEYPTCAK (1) 


CPP 149 




9 


AFTECCWASQLR (11). CCYDGACVNNDETCEQR (10) 


CPP 149 




9 


AFTECCWASQLR (11), CCYDGACVNNDETCEQR (11) 


CPP 149 


15 


9 


AFTECCWASQLR (8). CCYDGACVNNDETCEQR (8) 


CPP 149 


t5 


11 


AFTECCWASQLR (7) 1 


CPP 149 


16 ' 8 


AFTECCWASCa-R (8-9) 


CPP 149 


16 


9 


AFTECCWASQLR (9) 


CPP 149 


17 


10 


AFTECCWASCiLR (7). CCYDGACVNNDETCEQR (7) 


CPP 150 


14 


22 


TNFDNDIALVR (12) 


CPP 150 


t 

14 


23 


SNALDIJFQTDLTGQK (11), SSNNPHSPJVEEFQVPYNK (1 1). TNFDNDIALVR (10-11), 
[ VEDPESTLFGSVIR (8) 


CPP 150 


14 


24 


DWQJTCLDGFEWEGR (12). EDTPNSVWEPAK (9), QFGPYCGHGFPGPLNIETK (8-1 2X 
SNALDIIFQTDLTGQK (8-9. 12). SSNNPHSPIVEEFQVPYNK (9), TNFDNDIALVR (8-12) 


CPP 150 


14 


25 


QFGPYCGHGFPGPLNIETK (9), a^ALDIIFOTDLTGQK (5), TNFDNDIALVR (5. 9). 
VEDPESTLFGSVIR (9) 


CPP 150 


14 


26 


GDSGGAFAVCfflPNDK (8). SNALDIIFQTDLTGQK (7-8). TNFDNDIALVR (7) 


CPP 150 


14 


27 


GDSGGAFAVQDPNDK (8). QFGPYCGHGFPGPLNIETK (5), SNALDIIFQTDLTGQK (6, 8). ) 
TNFDNDIALVR (5. 7. 8) j 


CPP 150 


\ -1 
14 


' 29 


10.11.12,13. 14. 16. 17,19,20) 


CPP 150 


14 


30 


GDSGGAFAVQDPNDK (8). QFGPYCGHGFPGPLNIETK (11). TNFDNDIALVR (7) 


CPP 151 


10 


22 


SFEGLGQLEVLTLDHNQLQEVK (8) 


CPP 151 


12 


21 


SFEGLGQLEVLTLDHNQLQEVK (14) 


CPP 151 


13 


23 


LAELPADALGPLQR (12). LAYLQPALFS^AELR (11), LEALPNSLLAPLGR (12). 
VAGLLEDTFPGUGLR (11-12) 


CPP 501 


2 


21 


EFLEDTCVQYVQK (7), TQSGLQSYUQFHGLVR (7) 


CPP 501 


2 ! 


22 


CFLGCELPPEGSR (6), EFLEDTCVQYVQK (6), TQSGLQSYLLQFHGLVR (4, 6) 


CPP 501 


2 < 23 


EFLEDTCVQWQK (6), TQSGLQSYLLCa=HGLVR (5) 


CPP 502 


15 


2 


ALNSI)DVYHK(I) 


CPP 502 


17 


17 


ALNSIIDVYHK (9). GADVWFK (9) 


CPP 502 17 


18 


ALNSIIDVYHK (7). LLETECPQYIR (7) 


CPP 502 


18 


16 


ALNSIIDVYHK (8) 


CPP 502 


18 


17 


ALNSIIDVYHK (6-10), GADVWFK (9). GNFHAVYR (9), LLETECPQYIR (8, 10), MLTELEK 
(9) 


CPP 502 


18 


18 


ALNSIIDWHK (6-7), GADVWFK (7). LUsTfcCPQYIR (6-7) 


CPP 502 


16 


19 


ALNSIIDVYHK (7). LLETECPQYIR (7) 


CPP 502 


18 


20 


ALNSIIDVYHK (6). LLETECPQYIR (5) 


CPP 502 


18 


21 


LLETECPQYIR (6) 


CPP 502 1 IB 


22 


ALNSIIDVYHK (5). LLETECPQYIR (4) 
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Table 2 


CPP# 


CEX 


RP1 j Tiyptic Sequences (RP2) 


j CPP 502 


18 


1 23 I tlHTECPQY.R(5) ^ 


CPP502 


1 18 1 24 1 ALNSUDVYHK (3). LLETECPQYIR (3) 


C^502 


18 j 25 ! ALNSHDVYHK (3). LLETECPQYIR (3) 


CPP 502 


18 


j 26 1 U-ETECPQYIR (4) 


CPP 502 


18 j 27 


ALNSHDVYHK <4), LLETECPQYIR (4) 


CPP 503 


16 1 21 


FALLGDFFR (6) 


CPP 503 


17 


18 


CMGTVTLNCIAR (7), FAUGOFFR (6-8), GSFDISCDK (7) 


CPP 503 


17 


19 


CMGTVTLNQAR (7), FALLGDFFR (6-7), IKDFLR (7) 


CPP 503 


17 


20 


FALLGDFFR (6). FALLGDFFRK (6) 


1 CPP 503 


17 


21 


FALLGDFFR (5-6). fKDFLR (6). TTQQSPEDCDFK (6) 


1 CPP 503 


17 


22 


C»flGTVTLNQAR (4), FALLGDFFR (4. 6) 1 


' CPP 503 


17 


23 


C^IGTVTLNQAR (4), FALLGDFFR (4), QVLSYKEAVLR (4) 


CPP 503 


' 17 

t! 


24 


CMGTVTLNQAR (2-3), FALLGDFFR (3) 


CPP 503 


' 17 


1 25 


I CMGTVTLNQAR (2), FALLGDFFR (3). FALLGDFFRK (2) 


CPP 503 


17 


26 


TTQQSPEDCOFKK (2) 


CPP 503 


17 


27 


CMGTVTLNQAR (4), FALLC^FFR (4), TTQQSPe^CDFKK (4) 


CPP 503 


17 


28 


FALLGDFFR (5) 


CPP 503 


17 


29 


CMGTVTLhiQAR (4), FALLC^DFFR (4). GSFDISCDK (4). TTQQSPEDCDFK (4). 
TTQQSPEDCOFKK (4) 


CPP 503 


17 


ZiO 


FALLGDFFR (4) 


, CPP 503 


18 


12 


AIDGINQR (10) ( 


CPP5oa 




fO f <:^yK3T>mM3AR (7), FALLGOFFR (6-7), FAILGDFFRK (7) 


CPP 503 


18 


20 ' CWC3TVTLNQAR (5-6), FALLGDFFR (5-6), FALLGDFFRK (5-6) 


CPP 503 


18 ] 


21 t CMGTVTtNQAR (6), FALLGDFFR (5-7), IKDFLR (5) 


CPP 503 


18 


1 

22 


i CMGTVTLNQAR (4). FALLGDFFR (4-5), GSTOISCDK (4), GSFDISCOKDNK (4), IKDFLR 
(4) 


CPP 503 


18 


23 


FALLGDFFR (4-5) 


CPP S)3 


18 


24 


CMGTVTLNQAR (2), FALLGDFFR (2). FALLGDFFRK (2) 


CPP 503 


16 


25 


CMGTVTLNQAR (2). FAUGDFFR (2-^X FAUGDFFRK (2-3), GSFDISCDK (3) 


CPP 503 


18 


28 


CMGnrVTLNQAR (4), FALLGDFFR (3^>, QVLSYKEAVLR (4) 


CPP 503 


18 


27 


CMGTVTLNQAR (4), FALLGDFFR (3-4). TTQQSPEDCDFK (4) 


CPP 503 


16 


28 


FALLGDFFR (3) ; 


CPP 503 


18 


29 


FALLGDFRR (3-4) 


CPP 503 


18 


30 


CMGTVTLNQAR (3-4), FALLGDFFR (3^). FALLGDFFRK (3), TTQQSPEDCOFKK (4) 


CPP 504 


9 


15 


VPLQQI^FQDNQFQGK (15) 


CPP 504 


9 


16 


CDYWIR (11), ELTSELK (10). MYATOTELK (10-11), SLGLPEKHIVFPVPIDQCIDG (10), 
SYPGLTSYLVR (11). TFVPGCQPGEFTLGNIK (10.11), VPLQa\FQDNQFQGK (10-11). 
WSTNYNQHAMVFFK (10). WYWGLAGNAIIR (^0) 


CPP 504 


9 


18 


VPLQQNFQDNQFQGK (8) 


' CPP5C5 


6 


8 


WEPPEKDDQLWLFPVQKPK (8) 


CPP 505 1 6 


10 


AW^rtETEDTLGR <8) 


i CPP505 1 


7 

L 


9 


WEPPEKDDQLWLFPVQKPK (12) [ 
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Table 2 



CPP# 


CEX 


RP1 


Tryptfc Sequences (RP2) 


CPP505 


8 


10 


AWMETEDTLGR (8), WEPPEKDDQLWLFPVQKPK (8) 


CPP505 


8 


11 


AWMETEDTLGR (8-9), HWPSEQDPEKAWGAR (8). LLTTEEKPR (8), 
LWVrWPNHQVLLi^EEDQDHfYHPQ (8). \A/EPPEKDDQLWLFPVCfi<PK (8) 


[ CPP505 
) 


9 


8 


LLTTEEKPR (11-13). WEPPEKDDQLWLFPVQKPK (12) 


- 

CPP 505 


9 


9 


AWI^/ETEDT^GR (11. 13. 14, 15), GPILPGTK (13), HWreEQOPEK (14). 
HWPSECaJPEKAWGAR (12). LLTTEEKPR (10, 13. 15. 17). 
WEPPEKDDQLWLFPVQKPK (11-15) 


CPP 505 


9 


10 


AWMETEDTLGR (9-11. 13. 14), DDQLWLFPVCiKPK (9-10), 
LVWMPMHOVLLGPEEDQDHIYHPQ (10), WEPreKDDQLWLFPVQKPK (11) 


CPP 505 


1 

i 

0 


1 

11 


AWMETEDTLGR (9-11. 13), GPILPGTK (11-12X HWPSEQDPEKAWGAR (11), 
LLTTEEKPR (10-12), LLTTEEKPRGQGR (11), LWVMPNHQVLLGPEEDCaJHIYHPQ (11- 
12), VLSPEPDHDSLYHPPPEEDtMKERPR (11). WEPPEKDDQLWLFPVQKPK (10-11) 


CPP 505 


10 


9 


AWMETEDTLGR (15-18), LLTTEEKPR (14-16). WEPPEKDDQLWLFPVQKPK (17) 


CPP 505 


10 


10 


AWMETEDfTLGR (8. 10, 11), HWPSEQDPEK (10), LLTTEEKPR (10), 
LWVMPNHQVLLGPEEDQDHIYHPQ (10-11), WEPPEKDDQLWLFPVQKPK (9-11) 


CPP 505 


10 


11 


AWMETEDTLGR (8). WEPPEKDDQLWLFPVQKPK (11) 


CPP 505 


11 


9 


LLTTEEKPR (14), WEPPEKDOQLWLFPVWPK (16-17) 


CPP 505 


11 


10 


AWMETEDTLGR (11). LWVMPNHQVaGPEEDQDHlYHPQ (11). 

WEPPEKDDQLWL^WQKPK {9. 11) 
] 


CPP5(W 


8 


19 


EVMPSIQSLDALVK (5) 


: CPP 506 
1 CPP 506 


8 


20 


EVMPSIQSLDALVK (5) 


f * i ^ 1 




CPP 506 


8 


! 25 


EVK^PSIQSLDALVK (3) 


CPP 607 


5 


7 


N/WTRSPQQR (11-12) 


CPP 507 


6 


6 


NANTHSPQQR (8) 


CPP 507 


8 


8 


NANTHSPQQR (7) 


CPP 507 


10 


7 


NANTFISPQQR (7-8. 11). YESHESMESYELNPFINRR (12) 


CPP 507 


10 


8 


NANTFlSPGtC^ (6) 


CPP 507 


11 


8 


NANTFISPQQR (8^, 11,12) 


CPP 507 j 


i 11 


11 


NANTF1SP(2QR (8) 


CPP 507 


12 


8 


NANTFISPQQR (6-7, 12) 


Cf^ 507 


12 


9 


NANTFISPQQR (7-9) 


CPP 507 


13 


8 


NANTFISPQQR (8) 


CPP 507 


13 


9 


NANTFISPQQR (8-12) , 


CPP 507 


14 


6 


NANTFISPQQR (10) 


CPP 507 


14 


7 


NANTRSPQQR (10-14) 


CPP 507 


14 


9 


NANTFISPQQR (8) 


CPP 507 


14 


11 


NANTRSPQQR (7) j 


CPP 507 


15 


7 


NANTRSPC^ (16) 


CPP 507 


16 


8 


NANTFISPQQR (7) 


CPP 508 


4 


12 


GPETLC^iaAELVDALQFVCGDR (8) 


CPP 508 


5 


13 i GPETLCGAELVOALQF=VCGDR (8-9) 
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Table 2 


CPP# 


CEX 


RP1 


Tiyptfc Se<{uences (RP2} 


CPP 508 


5 


16 


GPETLCI3AEI.VDALQFVCGDR (4) 


CPP 508 


6 


11 


GPETLCGAELVDAtQFVCGOR (10) 


CPP 503 


6 


12 


GPETLCGAELVDALQFVCGDR (8) 


CPP 508 


6 


13 


GPETLCGAELVDALQFVCGDR (8. 12) 


CPP 508 


7 


10 


GPETLCGAELVDALQFVCGDR (1 1 ) 


CPP 508 


7 


11 


GFYFNKPTGYGSSSR (11), GPETLCGAS.VDALQFVCGOR (11-13), RAPQTGIVDECCFR 

(13-14) 


CPP5C8 


7 


12 


1 GFYFNKPTGYGSS^ (9), GPETLCGAELVDALQFVCGDR (8-9) 


CPP5C» 


8 


12 


APQTGIVDECCFR (8), GPETLCGAELVDALQFVCGDR (7-8) 


CPP 508 


8 


13 


GFYFNKPTGYGSSSR (6-7), GPETLCGAELVDALQFVCGDR (6-8), RAPQTGIVDECCFR 
(7) 


CPP 508 


8 


14 


APQTGIVDECCFR (6), GFYFNKPTGYGSSSR (6), GPETLCGAELVDALQFVCGDR (6) 


a»P508 


1 9 




^ APQTGIVDECCFR (11), GPETLCGAELVDALQFVCGDR (10-11) 


1 

CPP 508 


9 


12 


APQTGIVDECCFR (1,7. 8. 9). GFYFNKPTGYGSSSR (1. 7. 8). 
GPETLCGAELVDALQFVCGDR (1, 7. 8. 9, 10, tl. 12), RAPQTGIVDECCFR (7-8) 


CPP 508 


9 


13 1 APQTGIVDECCFR (8-9). GPETLCGAELVDALQFVCGDR (8-9). RAPQTGIVDECCFR (8) 


CPP SOS 


9 


14 


GFYFNKPTGYGSSSR (7). GPETLCGAELVDALQFVCGDR (7), RAPQTGIVDECCFR (6-7) 


CPP 508 


9 


15 


GFYFNKPTGYGSSSR (7). RAPQTGIVDECCFR (7) 


CPP 508 


9 


16 


GFYFNKPTGYGSSSR (5) 


CPP 508 


^ 1 




RAPQTGIVDECCFR (3) 






\ 


APQTGIVDECCFR (10-12). GFYFNKPTGYGSSSR (8-10, 12). j 


1 

CPP 508 


\ 

10 


12 


9, 12). RAPQTGIVDECCFR (8). RLEMYCAPLKPAK (7) 


CPP 508 


10 


13 


GFYFNKPTGY^SSR (7), GPETLCGAELVDALQFVCGDR (7-8. 12) 


CPP 508 1 
I . , 


1 11 


10 


GFYFNKPTGYGSSSR (11), GPETLCGAELVDALQFVCGDR (9-11) 


i 

CPP 508 


11 


11 


APQTGI>^ECCFR (10-12). GFYFNKPTGYGSSSR (9-11), GPETLCGAELVDALQFVCGDR 
(9-14). LEMYCAPLKPAK (11), RAPQTGIVDECCFR (9-12). RLEMYCAPLKPAK (10) 


CPP 508 


11 


12 


APQTGFVDECCFR (7-8), GPETuCGAELVDALQFVCGDR (7^) 


CPP 508 


11 


13 


APQTGtVDECCFR (8), GPETLCGAELVDALQFVCGDR (8-9) 


CPP 508 


11 


19 


GPETLCGAELVDALQFVCGDR (3) 


CPP 508 


12 


11 


iW'QTGJVDECCFR (8-10), GFYFNKPTGYGSSSR (8^), GPETLCGAELVDALQFVCGDR 
(8-10), LEMYCAPLKPAK (9). RAPQTGiVDECCFR (9). RLEMYCAPLKPAK (9) 


CPP5C8 


'!2 


12 


APOTGIVDECCFR (7-8), GFYFNKPTGYGSSSR (7, 10), GPETLCGAELVDW.QFVCGDR 
(7-13). RAPQTGIVDECCOT( (7) 


CPP 508 


12 


13 


APQTGIVDECCFR (12), GFYFNKPTGYGSSSR (11-12), GPETLCGAELVDALQFVCGDR 
(S. 12) 


CPP 508 


12 


14 


RAPQTGIVDECCFR (6) 


CPP 508 


12 


19 


GPETLCGAELVDALQFVCGDR (3) 


CPP 508 : 


12 


20 


GPETLCGAELVDALQFVCGDR (1) 


CPP 508 


13 


11 


1 GFYFNKPTGYGSSSR (8), OTETLCGAELVDALQFVCGDR (9) 


CPP 508 


13 


12 


GPETLCGAELVDALQFVCGDR (8) 


CPP 508 


13 

1 


13 


<^ETLCGAELVDALQFVCGDR (8-9) 
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Table 2 


CPP# 1 


CEX : RP1 1 Tryptic ^uences (RP2) 


\ , APQTGIVDECCFR (9), GFYFNKPTGYGSSSR (9). GPETLCGAELVDALQhVCUDK (8-10), 
CPP508 1 14 ' 10 ii RAPQTGtVDECCFR (8) 


CPP508 [ 14 * 


11 ( 


GPm.C6A£tVOALQFVCGDR (9, 11) 


CPPS08 I 14 


12 


APQTGJVDECCFR (8). GPETLCGAELVDALQFVCGD.H (7-8) 


CPP508 j 


14 


13 


GPETLCGAELVDALQFVCGDR (8) 


CPPSOS 


15 


10 


APOfTGIVDECCFR (8). GPETLCGAELVOALOFVCGOR <S-10) 


CPP 508 


15 


11 


APQTGIVDECCFR (8^). GPEU.CGAELVDALQFVCGOR (8-9) 


CPP508 


IS 


12 


GFYFNKPTGYGSSSR (7>. GPETLCGAELVDALQFVCGDR (7-8, 10) 


CPP 508 


15 


13 


GPETLCGAELVDALQFVCGDR (8) 


CPPSOS 


16 


11 


GPETLCGAELVDALQFVCGDR (9) 


CPPSOS 


16 


12 


APQTGIVDECCFR (6). GFYFNKPTGYGSSSR (6). GPETLCGAELVDALQFVCGDR (M) 


CPP 508 


16 


13 


GPETLCGAELVDALQFVCGDR (7) 


CPPSOS 


17 


11 


APQTGIVDECCFR (8). GFYFNKPTGYGSSSR (8). GPETLCGAELVDALQFVCGDR (7-8) 


CPPSOS 


17 


12 


! APQTGfVDECCFR (6). GFYFNKPTGYC^SSR (6), GPETLCCS/^LVDALQFVCGDR (6-7) 


CPPSOS 


17 


13 ; APQTGrVDECCFR (6). GFYFNKPTGYGSSSR (6), GPETLCGAELVDALQFVCGDR (6-7) 


CPPSOS 


17 


15 ! 


' GPETLCGAELVDALQFVCGDR (5) 


CPP 508 


17 


20 


GPETLCGAELVDALQFVCGDR (1) 


CPP 508 


18 


11 


GPETLCGAELVDALQFVCGDR (8) 


CPPSOS 


18 


12 


GPETLCGAELVDALQFVCGDR (6, 8) 


CPP SOS 


18 


13 


GPETLCGAELVDALQFVCGDR (6-7) 


\ CPPSOS 
CPR50& 


18 


16 

B~ 


GPETLCGAELVDALQFVCGDR (2) J 
VPFNGQDPVK (10) 


CPPSOS 


6 


8 


AQEPVKGPVSTKPGSCPJiUR (7^), CAMLNPPNR (7-8). CLKDTDCPGIK (7), 
VPFNGQDPVK (7). VPFNGQDPVKGQVSVK (7) 


CPP5(^ 


8 


8 I AQEPVKGPVSTKPGSCPllUR(7),CAMa-NPPNR(7) 


CPP 509 


9 


8 1 VPFNGQDPVK (7) 


CPP 509 

1 1 


10 


S j GPVSTKPGSCPIiUR (8) 



Table 2 details, for each CPP, tiie sequences detected by msLSS spectrometry according to As 
procedures described in Example 1 . In addition. Table 2 indicates in which fractions of flie (HEX, 
RPl, and RP2 chromatographies each sequence was foimd. 

5 

The CPPs listed in Table 2 were all idraiified as differentially expressed betweei individuals 
with cardiovascular disordm and control individuals using the procedure described in ExaxiQ)le 1 . hi 
particular, each CPP listed in Table 2 was found to vary between Ihe control and disease samples as 
detailed in Table 3 below. 

10 

Tables 
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Table 3 


tr 


wird»tiwn VI vaiiauwii 


CPP8 


Idenbfied En Disease only 


CPP 12 


Identified at a higher tevel ui Dsease 


CPP 13 


tdenfified at a higher tevel In Controls 


CPP 14 


Identic at a higher tevel bi Disease 


CPP 15 


idene^ed at a higher tavel fn Disease 


CPP 16 


Idenffied at a higher level in Disease 


CPP 18 


IdonBDsd at a higher tevel In Disease 


CPP 19 


IdenBBsd in Conbols onty 


CPP 40 


IdenSfled at a h^her level fn Controls 


CPP 41 


IdentiBed at a higher Cevel In Controls 


CPP 149 


Identifed In Disease only 


CPP 150 


Identic at a higher tevel in Disease 


CPP 151 


Identifed at a higher level in Disease 


CPP 501 


Identified at a higher Eevel in Disease 


CPP 502 


idenOfled at a holier £evel In ContnoSs 


CPP 503 


Identified at a higher Eevel tn Controls 


CPP 504 


Iden^ed at a iitgher lev^ in Controls 


CPP 505 


Idsnfifled at a hfghsr level In Disease 


CPP SOS 


Idenfified at a iitgher level in Disease 


CPP 507 


Identified at a higher Eevei In Disease 


CPP 508 


ldenttf.ed in D'sease c»ily 



One skilled in the art can use CPPs of the invention with a number of additional CFPs from 
Table 2, chos^ using a suitable analysis of the levels of the CPPs from Table 2 measured in a number 
of diseased individuals and control individuals through die methods of Example 1. The strategies for 
discovering such combinations of CPPs need to regard each CPP as one variable and the disease as a 
joint, nxulti-variate efiect caused by interaction of these variables. 

Lin^ Discriminant Analysis (LDA) is one such analysis procedure, which can be used to 
detect significant association betwe^ a cluster of variables (i.e. CPPs) and cardiovascular diseases. In 
performing IDA, a set of weights is associated with each variable (i.e. CPP) so that the linear 
combination of weights and the measured values of the variables can identify Ihe disease state by 
discriminating between subjects having a cardiovascular diease and subjects free from cardivascular 
diseases. Enhancements to the LDA allow stepwise inclusion (or removal) of variables to optimize the 
discriminant power of the model. The results of the LDA is therefore a cluster of CPPs which can be 
used without limitations for diagnosis, prognosis, ther^y or drug development Other enhanced 
versions of LDA, such as Flexible Discriminant Analysis permit the use of non-linear combinations of 
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variables to discriminate a disease state from a normal state. The results of the discriminant analysis 
can be vmfied by post-hoc tests and also by repeating the analysis using alternative techniques such 
as classification trees. 



Drug screening assess 

The invention provides a method (also referred to h^ein as a "screening assay") for 
identifying candidate modulators (e.g., small molecules and peptides, antibodies, peptidomimetics or 
oHxer drugs) which bind to CPPs, have a modulatory effe^rt on, for example, CPP expression or 
preferably CPP biological activity, hi some embodiments small molecules can be generated using 
combinatorial chemisty or can be obtained from a natural producte library. Assays may be cell based 
or non-cell based assays. Drug screening assays maybe binding assays or more preferentially 
functional assays, as iurth^ described. 

When the invention is used for drug development, e.g., to determine the abihty of a CPP 
modulator ot drug candidate to induce an anti-cardiovascular disorder response, the body fluid 
analysd for the level of at least one CPP is preferably from a non-human mflTmnal The non-hxmian 
mammal is preferably one in which &e induction of an anti-cardiovascular disorder response by 
endogenous and/or exogenous agents is predictive of the induction of such a response in a human. 

Agents tiiat are found, using screening assays as further described herein, to modulate CPP 
activity by at least 5%, more preferably by at least 10%, still more preferably by at least 30%, still 
more preferably by at least 50%, still more preferably by at least 70%, even more preferably by at 
least 90 %, may be selected for fiirflier testing as a prophylactic and/or therapeutic anti-cardiovascular 
disease agent. 

In another aspect, agents that are found, using screening assays as furdier described herein, to 
modulate CPP expression by at least 5%, more preferably by at least 10%, still more preferably by at 
least 30%, stiU more preferably by at least 50%, still more preferably by at least 70%, even more 
preferably by at least 90 %, may be selected for further testing as a prophylactic and/or therapeutic 
anti-cardio\^iscular disease agent 

Agents that are found to modulate CPP activity may be used, for example, to modulate 
treatment regimens for cardiovascular disorders, or to reduce the symptoms of a cardiovascular 
disorder alone or in combination with other appropriate agents or treatments. 

Protem array methods are useful for screening and drug discovery. For example, one member 
of a receptor^ ligand pan- is docked to an adsorboit, and its ability to bind the binding partner is 
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